RECOMPRESSION: A SIMPLE AND POWERFUL TECHNIQUE FOR WORD 

EQUATIONS 

ARTUR JEZ 

INSTITUTE OF COMPUTER SCIENCE 

UNIVERSITY OF WROCLAW 

WROCLAW, POLAND 

Abstract. In this paper we present an application of a simple technique of local recompression, previ- 
ously developed by the author in the context of compressed membership problems [S] and compressed 
pattern matching [5], to word equations. The technique is based on local modification of variables 
(replacing X by aX or Xa) and iterative replacement of pairs of letters appearing in the equation by a 
£SJ ' 'fresh' letter, which can be seen as a bottom-up compression of the solution of the given word equation, 

t-H \ to be more specific, building an SLP (Straight-Line Programme) for the solution of the word equation. 

The interesting feature of this technique is that it, and its application, do not require any under- 
standing of the word equations nor of their solutions, which is rather uncommon in the area. 

Using this technique we give a new, independent and self-contained proofs of most of the known 
Q_) ■ results for word equations. To be more specific, the presented (nondeterministic) algorithm runs in 

^0 ' O(nlogn) space and in time polynomial in log JV, where N is the size of the length-minimal solution 

-—* , of the word equation. The presented algorithm can be easily generalised to a generator of all solutions 

*vj ■ of the given word equation (without increasing the space usage). Furthermore, a further analysis of 

the algorithm yields a doubly exponential upper bound on the size of the length-minimal solution, 
i i ■ The presented algorithm does not use exponential bound on the exponent of periodicity. Conversely, 

l_— J ' the analysis of the algorithm yields an independent proof of the exponential bound on exponent of 

Ph ! periodicity. 

^5 ■ We believe that the presented algorithm, its idea and analysis are far simpler than all previously 

O ' applied. Furthermore, thanks to it we can obtain a unified and simple approach to most of known 

results for word equations. 
«i ■ As a small additional result we show that for 0(1) variables (with arbitrary many appearances in 

the equation) word equations can be solved in linear space, i.e. they are context-sensitive. 
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m . 1. Introduction 

O- 

CN ' Word equations. Since the dawn of the computer science, the problem of word equations was one 

of the most intriguing in the field: given words U and V, consisting of letters (from T) and variables 

£> \ (from X) we are to check the satisfiability, i.e. decide, whether there is a substitution for variables, 

k> ■ which turns this formal equation into an equality of strings of letters. It is useful to think of a solution 

S as a homomorphism S : V U X — > T*, which is an identity on T. In the more general problem of 

solving the equation, we are to give representation of (all or some) solutions of the equation. 

The problem of satisfiability of word equations was first solved by Makanin [T3]. The proposed 
algorithm MakSAT transforms equations and large part of Makanin's work consists of proving that 
this procedure in fact terminates. While terminating, MakSAT complexity is very high. Over the years 
the algorithm was gradually improved: by Jaffar and independently Schulz to 4-NEXPTIME [23j 
Koscielski and Pacholski to 3-NEXPTIME [TDJ , by Diekert to 2-EXPSPACE (unpublished) and by 
Gutierrez to EXPSPACE [5]. It is worth mentioning that for 20 years no essentially different algorithm 
than MakSAT was proposed. On the other hand, as for today only a simple NP lower bound is known 
and it is widely believed that this problem is in NP. 

One of the key factors in the proof of termination, as well in later estimations of the complexity of 
the algorithm, was the estimation the upper bound on exponent of periodicity of the solution. Roughly 
speaking, the exponent of periodicity of a word w is the largest p such that w = w\u p W2 for some 
u/e, The original proof of Makanin gave a doubly exponential bound on the exponent of periodicity 
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of any length- minimal solution of word equations. Later it was shown by Koscielski and Pacholski 
that exponent of periodicity is at most exponential and that this bound is tight [lUj . 

A major independant step in the field was done by Plandowski and Rytter [22], who for the first 
time applied the notion of the compression to the solutions of the word equations. Roughly, they 
have shown that each length-minimal solution of the word equation is highly compressible, in the 
sense that using LZ compression (a popular practical standard for compression) we can represent each 
length-minimal solution of word equations (of size N) using a log iV-size encoding. This implies that 
also LZ-encoding of values of variables in such a solution has size at most log N. Thus, to solve the 
word equation it is enough to guess the LZ-encoding of S(X) for each variable X and verify that S(U) 
= S{V) under this substitution. The latter can be done using known (though recent at that time) 
polynomial methods for testing the equivalence of two SLPs [T7j. This yielded a new algorithm for 
word equations satisfiability, which works in (nondeterministic) polynomial in terms of log N and n. 
Unfortunately, at that time the only bound on iV followed from the original Makanin's algorithm, and 
it was triply exponential. This gave a 2-NEXPTIME algorithm, which was worse than EXPSPACE [3] 
published in the same year (though a little later). 

Later, Plandowski gave a doubly-exponential upper bound on the size of the minimal solution |18j . 
which immediately yielded a NEXPTIME algorithm PlaSat2EXP for the problem. This upper bound 
was obtained by a clever and careful analysis of the minimal solution using so-called D-factorisations, 
suggested by Mignosi. This algorithm also finds the solution, as it explicitly guesses solution's LZ- 
encoding. 

Soon after, another algorithm PlaSat, with a PSPACEU upper-bound was given by Plandowski |19j . 
This algorithm starts with a trivial equation e = e and has a set of operations that can be performed 
on the equation; so it can be seen as a rewriting system. The set of rewriting rules is quite simple 
and thus also the algorithm is easy to understand, moreover it is obvious that the rewriting rules 
preserve satisfiability. However the proof of completeness of this rewriting system (i.e. that it properly 
generates all satisfiable equations) is involved. It was based on usage of exponential expressions, 
which can be seen as a very simple compression, and on indexed factorisations of words, extending 
the already mentioned T>- factorisations. 

In some sense one can think that this result was obtained in stages, as PlaRytSAT, fuelled with 
theoretical results on P-factorisations, yielded PlaSat2EXP and this in in turn was upgraded to PlaSat, 
by exploiting better the interplay between the compression and factorisations. 

All mentioned algorithms have a little drawback: while they check satisfiability and can be modified 
to return some solution of the word equation, they do not solve it in the sense that they do not 
provide a representation of all solutions. This was fully resolved by Plandowski [2D], who gave an 
algorithm PlaSolve, which runs in PSPACqj and generates a compact representation of all (finite) 
solutions of a word equation. This algorithm uses an improved version of PlaSat, the PlaSatlmp, as 
subprocedure. The representation of the solutions is a directed multigraph, whose nodes are labelled 
with expressions and edges define substitutions for constants and variables. Such representation 
reduces many properties of word equations to reachability in graphs (which were exponentially larger), 
for instance the problem of finiteness of set of solutions is shown to be in PSPACE. 

Some research was also done in the restricted variants of word equations, most notably, there are 
polynomial-time algorithms for equations with only two variables [6j [2] . The variant with only one 
variable has almost-linear running time [3]; the special case of only one variable with 0(1) appearances 
in the equation has an optimal linear-time algorithm [11] , which works in a very simple computational 
model. 

Our contribution. In this paper, we present an application of a simple technique of local recompres- 
sion developed by the author and successfully applied to problems related with compressed data [51 [DJ . 
The idea of the technique is easily explained in terms of solutions of the equations rather than the 
equations themselves: consider a solution S(U) = S(V) of the equations U = V. In one phase we 
first list all pairs of different letters ab that appear as substrings in S(U) and S(V). For a fixed pair 
oh of this kind we greedily replace all appearances of ab in S(U) and S(V) by a new letter c. (A 



The presented algorithm has running time proportional to N, however, it can be extended so that it has the same 
mnnng-time bounds as the earlier PlaSat2EXP [5T] 

PlaSolve is implemented in PSPACE, but the generated representation can be exponential and thus only DEXPTIME 
running time was claimed. 
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slightly more complicated action is performed for pairs aa, for now we ignore this case to streamline 
the presentation of the main idea) . There are possible conflicts between such replacements for different 
types of pairs (consider string aba, in which we try to replace both pairs ab and ba), we resolve them 
by introducing some arbitrary order on types of pairs and performing the replacement for one type 
of pair at a time, according to the order. When all such pairs are replaced, we obtain another equal 
strings S'(U') and S'(V) (note that the equation U = V may have changed, and the new one is 
U' = V). Then we iterate the process. In each phase the strings are shortened by a constant factor, 
and so after C(log N) rounds we obtain a pair of trivial (i.e. consisting of a single letter) strings. Now, 
the original equation is solvable if and only if the obtained letters are the same. 

The presented method has some variants, for instance, the pairs that appear seldom are not com- 
pressed, pairs that do not overlap are compressed simultaneously etc. The respective variants, however, 
are always based on the general idea and the modifications are introduced to reach some specific goal. 

The most problematic part of this idea is that it performs the operation on the solutions, which 
can be large. If we were to simply guess the solution and then perform the compressions, this would 
have running time polynomial in N, which is not acceptable. However, we completely circumvent the 
problem, by performing the compression directly on the equation (the recompression): the pairs ab 
appearing in the solution are identified using only the equation and the compression of the solution 
is done implicitly, by compressing the constants in the equations. However, not all pairs of letters 
can be compressed in this way, as some of them appear on the 'crossing' between a variable and a 
constant: consider for instance S(X) = ab, a string of symbols Xc and a compression of a pair be. 
This is resolved by local decompression part of the method: when trying to compress the pair be in 
the example above we first replace X by Xb (implicitly changing S(X) from ab to a), obtaining the 
string of symbols Xbc, in which the pair be can be easily compressed. 

By simple calculations it can be shown that this method: 

• transforms solvable equations to solvable equations (for proper nondeterministic choices) 

• transforms unsolvable equations to unsolvable equations (for all nondeterministic choices) 

• does not introduce new variables 

• in each phase shortens each string (of letters) by a constant factor 

• in one phase introduces only a linear number of new letters to the equation 

In this way, correctness easily follows and both the 0(logNpo\y(n)) time and PSPACE bounds hold. 

Example 1. Consider an equation aXca = abYa with a solution S(X) = baba and S(Y) = abac. In 
the first phase, the algorithm wants to compress the pairs ab, ca, ac, ba in this order. To compress 
ab, it replaces X with bX, thus changing the substitution into S(X) = aba. After compression we 
obtain equation a'Xca = a'Ya. Notice, that this implicitly changed to solution into S(X) = a' a and 
S(Y) = a'ac To compress ca (into c'), we replace Y by Yc, thus implicitly changing the substitution 
into S(Y) = a'a. Then, we obtain the equation a' Xc' = a'Yc' with a solution S(X) = a'a and 
S(Y) = a'a. The remaining pairs no longer appear in the equations, and so we can proceed to the 
next phase. 

The main features of the presented technique is that, at the same time: it is easy to state and 
apply, its proof of correctness is simple and straightforward, only basic properties of word equations 
and strings are used in the design, application and analysis. The last property seems to be the 
most surprising, as in order to apply the technique, no understanding of the word equations and its 
solutions is actually needed. This is completely different than the approaches based on Makanin's 
approach [TdJ [2J1 QUI E] and Plandowski's constructions [HI 121 [2D] ; however, the PlaRytSAT [2J 
shared this treat. 

Results. Using the technique of local recompression we give a (nondeterministic) algorithm for testing 
satisfiability of word equations that works in time 0(logNpo\y(n)) and in 0(nlogn) space. Further- 
more, for 0(1) variables a more detailed analysis, the space consumption can be lowered to 0(n), thus 
showing that word equations are context-sensitive for a constant number of variables. 

The presented algorithm and its analysis are stand-alone, as they does not assume any (non-trivial) 
properties of the solutions of word equations. To the contrary, it supplies an easy proof of doubly- 
exponential upper bound of Plandowski [18] on lengths of lenght-minimal solutions as well as giving 
a new proof of exponential exponent of periodicity (though slightly weaker than the one presented by 
Koscielski and Pacholski |lUj). 
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The presented method can be easily modified, so that it can be used as a subprocedure in an 
algorithm generating a representation of all solutions, similarly as PlaSatlmp in PlaSolve. The repre- 
sentation provided by our algorithm is similar to representation provided by PlaSolve, i.e. a directed 
multigraph with edges representing substitutions. Then the algorithm for testing satisfiability is used 
to find out whether there is an edge between two given nodes and what is the substitution labelling 
it. The whole modification to our algorithm consists of replacing non-deterministic guesses of lengths 
of strings by guessing the arithmetical relation that these lengths satisfy. This can be compared with 
the subprocedure PlaSatlmp, which in its author's own words a 'nontrivial modification' of the PlaSat 
for testing the equations satisfiability. 

Presentation. We start off with presenting a recompression-based algorithm for word equations, in 
Section [3J Firstly, we shall describe only its basic properties, which are needed to show that it works 
in PSPACE and has 0(log iVpoly(n)) (nondeterminsitic) running time. More involved definitions as 
well as results are given in the following sections. To be more precise, in Section H] we analyse in 
more detail the structure of maximal repetitions of one letter in solutions of word equations. This 
allows reduction of space consumption to O(nlogn) and is essentially used in following sections. Using 
these results and a special encoding of letters we show that for 0(1) variables the space consumption 
can be further lowered to linear one, this is presented in Section [5j Then in Section [6J we recall 
the classification of solutions, given by Plandowski |2(J| . and related notions. Using this classification 
we generalise the main notions and algorithm to a generator of all solutions, see Section Lastly, 
in Section [51 we show that a more detailed analysis of the algorithm also yields alternative (simple) 
proofs of exponential bound on the exponent of periodicity and double exponential bound on the size 
of the length minimal solutions 

Comparison with approaches to word equations. The presented method and the obtained algorithm is 
independent from all previously known algorithms for word equations, i.e. from original MakSAT and 
its variants, from PlaRytSAT (and its variant PlaSat2EXP), from PSPACE algorithm PlaSat as well 
as its modification PlaSatlmp. In fact, the only algorithm, with which it can be somehow compared, 
is the LZ-based PlaRytSAT [22| . The key difference was that Plandowski and Rytter showed that a 
length-minimal solution has a short LZ-representation and then explicitly guessed and verify it. Thus 
their solution, in some sense, was 'global' (as it guessed the whole solution in one go) and based on 
solutions' properties. The novelty and importance of the here proposed method is that it does not 
use properties of the solutions and that it is very 'local', in the sense that it does not try to build the 
solution in one go and it modifies the equations and variables locally. In particular, in this was we are 
working with an SLP encoding of the solution, which is easier in handling than the LZ-representation. 
Lastly the presented algorithm uses only a very limited variant of exponent of periodicity, when the 
strings in question consist only of repetitions of a single letter. In such a case an exponential bound 
is easy to obtain. This makes the presented algorithm somehow similar to PlaRytSAT, which does not 
use at all the bound on exponent of periodicity. 

Related techniques. While the presented method of recompression is relatively new, some of its ideas 
and inspirations goes quite back. This technique was developed in order to deal with fully compressed 
membership problem for NFA and the previous work on this topic by Mathissen and Lohrey [T3] already 
implemented the idea of replacing strings with fresh letters as well as modifications of the instance 
so that this is possible and treated maximal blocks of a single letter in a proper way. However, the 
replacement was not iterated, and the newly introduced blocks could not be further compressed. 

The idea of replacing short strings by a fresh letter and iterating this procedure was used by 
Mehlhorn et. al [16], in their work on data structure for equality tests for dynamic strings (cf. also 
an improved implementation of a similar data structure by Brodal et al. [T]). They viewed this 
process as 'hashing'. In particular their method can be straightforwardly applied to equality testing 
for SLPs, yielding a nearly cubic (using data structure of Mehlhorn et. al [16]) or even nearly quadratic 
algorithm (using data structure of Brodal et al. [1]), as observed by Gawrychowski [3|. However, the 
inside technical details of the construction makes more involved extensions problematic: while this 
method can be used to build 'canonical' SLPs for the text and the pattern, there is no apparent way 
to control how these SLPs actually look like and how do they encode the strings. 

A similar technique, based on replacement of pairs and blocks of the same letter was proposed 
also by Sakamoto [23] in the context of constructing a smallest SLP generating a given word. His 
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algorithm was inspired by the RePair algorithm [13], which is a practical grammar-based compressor. 
It possessed the important features of the method: iterated replacement of pairs and blocks, phases 
(i.e. ignoring letters recently introduced). However, the analysis that stressed the modification of the 
variables (nonterminals) was not introduced and it was done in a more crude way. The author would 
like to note that (unfortunately) he became aware of the string of research started by Skamamoto only 
recently. 

Citing conventions. As this paper aims at being stand alone, many lemmata known from the literature, 
are supplied with proofs (though sometimes different than the original ones). Thus, in order to 
distinguish these two types of results, whenever a theorem/lemma has a citation, it means that it was 
shown before, perhaps in a slightly different variant. Otherwise, the theorem/lemma is new. 

2. Main notions and techniques: local compression 

Let us formalise the main notions. By V we denote the set of letters appearing in the equation 
U = V or are used for representation of compressed strings. The set X denotes a set of variables. The 
equation is written as U = V, where U, V £ (r U X)*. By \U\, \V\ we denote the length of U and 
V , n denotes the length the input equation, n v denotes the number of appearances of variables in the 
input equation. 

A substitution is a morphism S : A'ur — > T* , such that S(a) = a for every a £ I\ Each substitution 
is naturally extended to (XLiT)*. The name represents the intuitive meaning that substitution simply 
replaces variables by (some) strings. A solution of an equation U = V is a substitution S, such that 
S(U) = S(V). Clearly, some solutions are 'smaller' than other and we are naturally interested in 
the 'smallest': We say that a solution S is a length-minimal, if for every solution S' it holds that 
\S(U)\<S'(U). 

Operations. In essence, the presented technique is based on performing two operations on S(U) and 
S(V), consider the first one: 

pair compression of ab: For two different letters ab appearing in S(U) replace each of ab in 
S(U) and S(V) by a fresh letter c. 

The compression of pair aa is ambiguous (consider pair aa and a string aaa) and thus problematic, 
we need a better notion. For a letter a £ T we say that a is a a's maximal block of length £, if there 
exist two letters x, y G T, where x ^ a ^ y and xa £ y is a substring of S(U), or a e y is a prefix of S(U) 
or xa e is a suffix of S(U). We refer to a's (.-block for shortness. Now, we can introduce the second 
operation performed on the solutions: 

block compression for a: For a letter a appearing in S(U) and for each maximal block a? 
replace all a e s in S(U) and S(V) by a fresh letter ct£. 

The lengths of the maximal blocks can be upper bounded using the well-known exponential bound 
on exponent of periodicity: 

Lemma 1 (Exponent of periodicity bound |10|). If solution S is length-minimal and u/ for w ^ e is 
a substring of S(U), then £ < 2 cn for some constant < c < 2. 

We shall use exponent of periodicity bound only to estimate the lengths of the maximal blocks (i.e. 
restrict w to single letters in the above definition), and in such a case the proof becomes substantially 
easier then the general one, see Section Furthermore, an alternative approach, which does not need 
the exponent of periodicity at all, is also possible, see Section [H 

As our algorithm runs in PSPACE, it may introduce a large number of 'fresh letters', and so if 
we insist that each of them is in fact different, this becomes problematic. However, it is enough to 
assume that a 'fresh letter' does not appear in the equation: after all, even if it appeared in some 
other iteration, this is completely irrelevant. 

Remark 1. Word Eq Sat introduces new letters to the instance, replacing pairs of letters or maximal 
blocks of one letter. We insist that these new symbols are called and treated as letters. On the 
other hand, we can think of them as non-terminals of a context-free grammar (to be more specific, of 
so-called SLP): if c replaced ab, then this corresponds to a production c — > ab, similarly, a£ — > a . In 
this way we can think that WordEqSat builds a context-free grammar (an SLP) generating S(U) as a 
unique word in the language. 
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Types of pairs and blocks. Both pair compression and block compression (however they are imple- 
mented) shorten S(U) (and S(V)), which gives the main foundation for this technique. On the other 
hand, sometimes it is hard to perform these operations: for instance, if we are to compress a pair ab 
and aX appears in U, moreover, S(X) begins with b, then the compression is problematic, as we need 
to somehow modify S(X). The following definition allows distinguishing between pairs (blocks) that 
are easy to compress and those that are not. 

Definition 1 (cf. [3 [9]). Given an equation U = V and a substitution S and a substring u € T + of 
S(U) (or S(V)) we say that this appearance of u is 

• explicit, if it comes from substring u of U (or V , respectively) 

• implicit, it it comes from S(X) for some variable X 

• crossing otherwise. 

A string u is crossing (with respect to a solution S) if it has a crossing appearance and non-crossing 
(with respect to a solution S) sotherwise. 

We say that a pair of ab is a crossing pair (with respect to a solution S), if ab has a crossing 
appearance. Otherwise, a pair is non-crossing. Unless explicitly stated, we consider crossing/non- 
crossing pairs ab in which a ^ b. Similarly, a letter a € V has a crossing block, if there is a maximal 
block of a which has a crossing appearance. This is equivalent to a (simpler) condition that aa is a 
crossing pair. 

Compression of noncrossing pairs is easy, so is block compression when a has no crossing block. In 
other cases, the compression seems difficult. 

Visible lengths of blocks. We say that a e is visible in S (or £ is a visible length of a block in S), if there 
is an appearance of the o's ^-block that is explicit or crossing or it is a prefix or suffix of some S(X); 
we say that £ is a visible length for a if there is a maximal block a . The following lemma shows that 
in length-minimal solutions all lengths of maximal blocks are visible. 

Lemma 2 (cf. [221 Lemma 6]). Let S be a length-minimal solution ofU = V. 

• If ab is a substring of S{U), where a^b, then ab is an explicit pair or a crossing pair. 

• If a k is a maximal block in S(U) then a has an explicit appearance in U or V and there is a 
visible appearance of a k . 

Proof. Suppose that ab has only implicit appearances. Consider S': S'(X) is S(X) with all abs 
removed. Since all appearances of ab in S(U) and S(V) are implicit, S'(U) (S'(V)) is obtained from 
S(U) (S(V), respectively), by removing all pairs ab. Hence S'(U) = S'(V), i.e. S' is a solution and it 
is clearly shorter than S, contradiction. 

Similar argument shows that if a appears in S(U) then it has an explicit appearance in U or V. 

To streamline the rest of the presentation and analysis, in the remainder of the proof assume that 
both U and V begin and end with a letter and not a variable; this is easy to achieve by prepending $ 
and appending $' to both sides of the equation. Alternatively, the cases with variables beginning or 
ending U or V can be handled in the same way, as the general case. 

Consider a maximal a block a , , for £ > in S(U) and the letter preceding (succeeding) it, say b 
and c, respectively; by the assumption that U and V begin and end with a letter, such b and c always 
exist. Consider the appearances of ba e c in S(U) and S(V). Since b ^ a ^ c, these appearances cannot 
have overlapping a's (though, if b = c, these letters can overlap for different appearances). We want 
to show that one of these appearances is crossing or explicit. In such case the corresponding a e proves 
that £ is a visible length, which ends the proof. 

So suppose that none of these appearances is crossing nor explicit. Consider S': defines S'(X) 
as S(X) with each ba e c replaced with be. This operation is well defined, as the a 1 blocks are non- 
overlapping. As in the first case it can be shown that S' is a solution (since all ba l c are implicit), 
which contradicts the assumption that S is length-minimal. □ 

Compression of noncrossing pairs and blocks. Intuitively, when ab is non-crossing, each of its appear- 
ance in S(U) is either explicit or implicit. Thus, to perform the pair compression of ab on S(U) it is 
enough to separately replace each explicit pair ab in U and change each ab in S(X) for each variable 
X. The latter is of course done implicitly (as S(X) is not written down anywhere). The appropriate 
Algorithm Q] is given below. 
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Algorithm 1 PairCompNCr(a, b) Pair compression for a non-crossing pair 

1: let c G r be an unused letter 

2: replace each explicit ab in U and V by c 

Similarly when none block of a has a crossing appearance, the a's blocks compression consists simply 
of replacing explicit a blocks. 

Algorithm 2 BlockCompNCr(a) Block compression for a letter a with no crossing block 



for each explicit a's £-block appearing in U or V do 
let a,£ € r be an unused letter 
replace every explicit a's ^-block appearing in U or V by ai 



In order to show the correctness of those two procedures, we need to first introduce some terminology 
and notation. 

Preserving satisfiability and unsatisfiability. We say that a nondeterminsitic procedure preserves un- 
satisfiability, when given a unsatisfiable word equation U = V it cannot transform it to a satisfiable 
one, regardless of the nondeterministic choices; such a procedure preserves satisfiability, if given a 
satisfiable equation U = V for some nondeterminsitic choices it returns a satisfible equation U' = V. 
Observe, that a composition of procedures preserving satisfiability (unsatisfiability) also preserves 
satisfiability (unsatisfiability) . 

A procedure that preserves satisfiability implements pair compression of ab for S, if given an equa- 
tion U = V with a solution S, for some nondeterministic choices it returns equation U' = V with a 
solution S', such that S'(U') is obtained from S(U) by replacing each ab by c; similarly we say that a 
procedure implements blocks compression of a for S. 

Observe that a very general class of operations preserve unsatisfiability: 

Lemma 3. Replacing occurrences of variable X with wXv for arbitrary w,v 6P preserves unsatis- 
fiability; replacing an occurrence of a word w £ T + with a fresh letter c preserves unsatisfiability. 

Proof. In the first case, if S' is a solution of U' = V then S defined as S(X) = wS'(X)v and 
S(Y) = S'(Y) otherwise is a solution of U = V. 

In the second case, if S' is a solution of U' = V then S obtained from S' by replacing each c with 
wisa solution of U = V . □ 

Properties of PairCompNCr and BlockCompNCr. Now we are ready to show properties of PairCompNCr(a, 6) 
and BlockCompNCr(a). 

Lemma 4. PairCompNCr(a, b) preserves the unsatisfiability, when ab is a non-crossing pair in an equa- 
tion U = V (for some solution S) then it preserves satisfiability and implements the pair compression 
of ab for S. 

Similarly, BlockCompNCr(a) preserves unsatisfiability and when a has no crossing blocks in U = V 
(for some solution S) it preserves satisfiability and implements the block compression of a for S. 

Proof. From Lemma [3] it follows that BlockCompNCr(a) preserves unsatisfiability. 

Suppose that U = V has a solution S such that ab is a noncrossing pair. Define S": S'(X) is 
equal to S(X) with each ab replaced with c (where c is a new letter). Consider S(U) and S'(U'). 
Then S'(U') is obtained from S(U) by replacing each ab: the explicit appearances of ab are replaced 
by PairCompNCr(a, b), the implicit ones are replaced by the definition of <S" and by the assumption 
there are no crossing appearances. The same applies to S(V) and S'iV'). Hence S'{U') = S'(V) and 
concludes the proof in this case. 

The proof for the block compression follows in the same way. □ 

Crossing pairs and blocks compression. The algorithms presented in the previous section cannot 
be directly applied to crossing pairs or to compression of a's blocks that have crossing appearances. 
To circumvent the problem, we modify the instance: if a pair ab is crossing because there is a variable 
X such that S(X) = bw for some word w and a is to the left of X, it is is enough to change S, so that 
S(X) = w; similar action is applied to variables Y ending with a and with b to the right. 
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This idea can be employed much more efficiently: consider a partition of T into 1^ and T r . The 
'left-popping' from each variable a letter from T r and 'right-popping' a letter from T^ guarantees that 
each pair ab G T(T r is non-crossing. Since pairs from TiT r do not overlap, after the preprocessing they 
can be compressed in parallel. Furthermore, it can be shown that for appropriate choice of Ti and T r 
a constant fraction of pairs from S(U) is of the form T(T r . 

Algorithm 3 Pop(r £ ,r r ) 

1: for X G X do 

2: let b be the first letter of S(X) > Guess 

3: if b G T r then 

4: replace each X in U and V by aX > Implicitly change S(X) = aw to S(X) = w 

5: if S(X) = e then > Guess 

6: remove X from U and V 

7: > Perform a symmetric action for the last letter 

Lemma 5. The Pop(r£,r r ) preserves satisfiability and unsatisfiability. 

Furthermore, if S is a solution of U = V then for some nondeterministic choices the obtained 
U' = V has a solution S' such that S'(U') = S(U) and for each pair ab from TiT r is non-crossing 
(with regards to S'). 

Proof. From Lemma [3] it follows that Pop(r^,r r ) preserves unsatisfiability. 

Conversely, suppose that U = V has a solution S. Let Pop(r£,r r ) always guesses according to 
S, i.e. in line El it guesses b as the first letter of S(X), and similarly a as the last letter of S(X), 
finally it removes X, when S(X) = e. Suppose that b £ T r and a G T^. Consider S"(X) defined as 
aS'(X)b = S(X) (when S(X) = a then S'(X) = e). It is easy to observe that S(U) = S'(U'), similarly 
S(V) = S'(V), hence S' is a solution of U' = V' . The cases in which b ^ T r or a £ T^ are done in the 
same way (for instance, when b ^T r and a 6^ then aS'(X') = S(X)). 

It is left to show that U' = V' each pair ab G T(T r is noncrossing with respect to such defined S'. 
Assume for the sake of contradiction that ab is crossing with respect to 5" of U' = V' . There are two 
cases 

a is to the left of some variable X and the first letter of S(X) is b: Since a G Fg, then 
Pop did not popped a letter a from X in line|U Hence the first letter of S(X) and S'(X) are 
the same. However, as in line H] the letter was not popped from X, the first letter of S(X) is 
not in r r , while the first letter of S'(X) is, contradiction 
b is to the right of some variable X and the last letter of S(X) is a: This case is sym- 
metric to the previous one. 
This ends the case inspection. Hence ab after the loop in line [5] is noncrossing with respect to S' . Note 
that for appropriate choices, all pairs ab in T^r,. become noncrossing. □ 

Now the presented procedures can be merged into one procedure that turns crossing pairs into 
noncrossing ones and then compresses them, effectively compressing crossing pairs. 

Algorithm 4 PairComp(r^,r r ) Turning crossing pairs from TiT r into non-crossing ones and com- 
pressing them 



run Pop(r£,r r ) 
for ab G r^r r do 

run PairCompNCr(a,6) 



Lemma 6. For any solution S PairComp(r£,r r ) preserves satisfiability and unsatisfiability and it 
implements the pair compression of each pair ab G TeT r . 

Proof. All subprocedures preserve the unsatisfiability, and so also PairComp(r£,r r ) preserves it. 

Concerning preserving the satisfiability and the implementation of the pair compression, observe, 
that by Lemma[l]each of PairCompNCr(a, b) implements the pair compression, when ab is noncrossing. 
Furthermore, by Lemma for appropriate choices after Pop(r£,r r ) the obtained equation U' = V 
has a solution S' such that S(U) = S'(U') and each ab G r^r r is noncrossing with regards to S", which 
concludes the claim. □ 
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The problems with crossing blocks can be solved in a similar fashion: a has a crossing block, if aa 
is a crossing pair. So we 'left-pop' a from X until the first letter of S(X) is different than a, we do the 
same with the ending letter b. This can be alternatively seen as removing the whole o-prefix (a-suffix, 
respectively) from X: suppose that S(X) = a i wb r , where w does not start with a nor end with b. 
Then we replace each X by a Xb r implicitly changing the solution to S(X) = w, see Algorithm [5j 



Algorithm 5 CutPrefSuff Cutting prefixes and suffixes 



for X G X do 

let a, b be the first and last letter of S(X) 

guess £x > 1) fx > > S(X) = a £x wb rx , where w does not begin with a nor end with b 

> If S(X) = a ix then r x = 



replace each X in U and V by a^ x Xb rx 

if S(w) = e then 

remove X from U and V 



> a ix , 6 rjf is stored in a compressed form, 



> implicitly change S(X) = a x wb rx to S(X) 



w 



> Guess 



Lemma 7. CutPrefSuff preserves unsatisfiability. For a solution S ofU = V and appropriate nonde- 
terministic choices it returns an equation U' = V that has a solution S' such that S(U) = S'(U') and 
U' = V' has no crossing blocks with respect to S' . 

Proof. Prom Lemma [3] we obtain that CutPrefSuff preserves unsatisfiability. 

We present the proof in the case when S(X) ^ a x for each variable, the argument in the other 
case is similar. 

Suppose that U = V has a solution S. Then let CutPrefSuff guess according to S, i.e. let £x > 1 



and rx > 1 be guessed so that S(X) 



~wxb Tx , where wx does not begin with a nor end with b 



Define S'(X) = wx- It is easy to see that S(U) = S'(U') and S(V) = S'(V'), in particular, S' is a 
solution of U' = V' . Furthermore, observe that as the first letter of wx is not a and the last is not b, 
there are no crossing blocks in U' = V with respect to S'. □ 

The CutPrefSuff allows defining a procedure BlockComp that compresses maximal blocks of all 
letters, regardless of whether they have crossing blocks or not. 



Algorithm 6 BlockComp Compressing blocks of a 



run CutPrefSuff 
for each letter a € T do 
BlockCompNCr(a) 



o Removes crossing blocks of a 



Lemma 8. BlockComp weakly unsatisfiability. 

For a solution S of U = V , for appropriate nondeterministic choices it preserves the satisfiability 
and implements the block compression (of all letters a). 

Proof. The proof is similar to the proof of Lemma As BlockComp is a composition of operations 
preserving unsatisfibility, it also preserves unsatisfiability. 

So suppose that U = V has a solution S. By Lemma [7] for appropriate nondeterministic choices, 
after CutPrefSuff in line[TJ the obtained (intermediate) equation U' = V has a solution S' such that 
S(U) = S'(U') and S(V) = S'(V') and there are no crossing blocks with respect to S' in U' = V . 
Then, by LemmaHl each of the BlockCompNCr(a) preserves satisfiability and implements the a blocks 
compression. As blocks of different letters are disjoint, this means that the loop in line [5] implements 
the blocks compression for each letter a € T. □ 



3. Main algorithm, its time and space consumption, solutions' size 
Now, the algorithm for testing satisfiability of word equations can be conveniently stated. 
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Algorithm 7 WordEqSat Checking the satisfiability of a word equation 



while \U\ > 1 or \V\ > 1 do 
BlockComp 

Letters 4— the set of letters present in U or V 
for i ^— 1 . . 2 do 

guess partition of Letters into Lettersi and Letters2 
PairComp(Lettersi, Letters^) 
Solve the problem naively > With sides of length 1, the problem is trivial 



We refer to one iteration of the main loop in WordEqSat as one phase. Observe that one phase of 
WordEqSat is executed in (nondeterministic) poly(|C/| + |V|) time. 

The properties of WordEqSat are summarised in the following theorem. 

Theorem 1. WordEqSat nondeterministically verifies the satisfiability of word equations. It can verify 
an existence of a length-minimal solution of length N in C(poly(n) logiV) time and 0(n 2 ) space. For 
appropriate choices, the stored equation has length 0(n). 

The analysis of the space consumption is done in Lemma El of time consumption in Lemma [TU] 
while the correctness is shown in Lemma [TTJ Furthermore, it is shown in the following section that 
the additional space consumption can be bounded by O(nlogn). 

Lemma 9. For appropriate nondeterministic choices, the equations stored by (successful) computation 
of WordEqSat are of length 0{n), the additional computation performed by WordEqSat use 0(n 2 ) 
space. 

Furthermore, for appropriate nondeterministic choices, the number of phases is at most C(logn + 

Proof. For the purpose of this proof let a symbol be either a € T, or a , where a € T and £ = 0(2 cn ), 
for constant c from Lemma [TJ Let us first calculate, how many symbols are introduced into the 
equation in one round: BlockComp is run once and it runs (also once) CutPrefSuff, which introduces 
two symbols per variable appearance; PairComp is run 2 times, each time it runs Pop which introduces 
at most two symbols per variable appearance. Hence, in one round, at most 6n v letters are introduced 
into the equation. 

On the other hand, the main task of the whole algorithm is compression: it can be shown that for 
appropriate choices large fraction of letters in U = V are compressed. 

Claim 1. Let U = V has a solution S. For appropriate choices, the equation U' = V obtained at the 
end one phase has a solution S' such that 

• 1/6 of letters in S(U) (rounding down) is compressed in S'(U'). 

• at least (\U\ + \V\ — §n„ — §)/6 of letters in U or V are compressed in U' or V'\ 

This can be used to show (inductively) that the length of U = V is at most 75n: clearly this bound 
holds for the input instance, which is length n. For the inductive step consider that there are at most 
Qn v symbols introduced into U' and V (some of them might be compressed later). On the other hand, 
by ClaimUl the number of original letters of U and V decreased by at least (\U\ + |V| — 3ra„ — 2)/12. 
Hence, 



(1) \U'\ + \V'\ < \U\ + \V\ - (\U\ + \V\ - ^n v - |)/12 + 6n t 



< 


\u\ + 


v\ 


-(\u\ 


+ \v 


5 

n 

3 


< 


> 


+ 


\v\) + 


3 

— n 
12 


4- 6n 


< 


11 7^ 

12 75n 


+ 


rV 5n 







<75n . 

Note, that this is the number of symbols, and not letters. However, each symbol representing a is 
compressed into a single letter before the end of the phase, so the given bound holds for the number 
of letters as well. 

As each symbol representing a e for some £, by exponent of periodicity (Lemma [T]), £ can be encoded 
using 0(n) bits, which gives 0{n 2 ) space consumption for symbols in total. 



RECOMPRESSION: A SIMPLE AND POWERFUL TECHNIQUE FOR WORD EQUATIONS 11 

Also, we need to take into the account the space consumption of variables: the number of variables 
is at most n v < n, (as WordEqSat does not introduce new variable in to the equation) and so they 
also fit in 0(nlogn v ) bits; since the number of letters is at most linear, we can encode each letter on 
at most log n bits. 

Observe now that calculation similar to the one in ([JJ shows that if the equation U = V has length 
larger than 15071^,, its length drops by a constant factor in a phase. Hence, after at most O(logra) 
phases the length of the equation is reduced to 0(n v ). We can imagine that we restart WordEqSat 
for this instance. Since the length of the equation will not exceed cn v for some constant c and the 
accepting computation clearly does not have loops, we obtain that the number of phases is at most 
C(logn + (cn v ) criv ) = C(logn + n£ n ") for some larger constant c'. 

It remains to give the proof of Claim [TJ 

proof of Claim [TJ We first show the first property. Divide S(U) into three-letters segments (ignore the 
last, partial segment). Consider a random partition of Letters into Letters± and Letters2, each letters 
goes into the part of the partition with probability 1/2. Take any segment appearing in S(U), let it 
be abc. We show that with probability at least 1/2 at least one letter in this segment is compressed. 
If any of those letters is equal to its neighbouring letters, then it is compressed by Lemma 

So suppose that none of these letters is the same as its neighbouring letters, in particular, they are 
not compressed by BlockComp. Consider a random partition of Letters into Lettersi and Letters2, each 
letter goes to one part with probability 1/2. There is a compression inside abc if ab £ Letters\Letters2 
or be S Letter s\Letters2- Each of those events has probability 1/4 and they are disjoint, hence the 
compression appears with probability 1/2. So regardless of the case, with probability 1/2 at least one 
of letters in abc is compressed. There are U^^OI/^J three-letter segments. The expected number of 
segments in which at least one letter is compressed is thus at least LI , S'(^ r )l/6ji so for some partition 
at least U^kOI/Sj letters are compressed. 

Concerning the second property, observe, that the analysis applies in the same way, consider any 
explicit word w' between two variables in U or V (or the explicit word beginning or ending U or 
V). Then the analysis is the same, except that the number of segments of w' is at least U^'|/3j < 
|u/|/3 — 2/3. Since there are at most n v + 1 such words and in U and V there are together at most 
n v variables, there are (\U\ + \V\ — |n„ — |)/3 such segments. The same expected- value argument 
yields that at least (|f| + |V| — §n„ — |)/6 letters are compressed, note that the appropriate partition 
is guessed as the second partition of Letters to Lettersi and Letters2- This shows the claim. □ 

□ 

Lemma 10. Let N be the size of the length-minimal solution. Then for appropriate nondeterministic 
choices WordEqSat accepts after C(logiV) phases. 

Proof. The proof follows from Claim [TJ □ 

Lemma 11. WordEqSat nondeterministically verifies the satisfiability of a word equation. 

Proof. Firstly, observe that if |J7| = \V\ = 1 then the satisfiability of word equation is trivial to verify, 
which is done in last line of WordEqSat. 

Note that as WordEqSat is a composition of subprocedures preserving satisfiability and unsatisfibil- 
ity, it also preserves them. So if the equation is unsatisfiable, 'NO' is never returned, while if 'YES' is 
returned, the original equation is satisfiable. Finally, Lemma [10] shows that for a satisfiable solution, 
i.e. a one that has a length-minimal solution of length N for some N, WordEqSat accepts the equation 
(for appropriate nondeterministic choices). □ 

4. Maximal blocks 

The quadratic memory consumption of WordEqSat is due to BlockComp. Since we aim at O(nlogn) 
memory consumption, we need to improve it. To this end we analyse more carefully the structure and 
possible lengths of maximal blocks. This analysis allows a different approach to blocks compression: 
instead of guessing the explicit values of a- prefixes and 6-sufnxes of variables, we parametrise those 
values and check which of them are the allowed. To be more precise, the lengths of maximal blocks are 
expressed in terms of lengths of a-prefixes and 6-sufnxes and blocks of the same lengths are identified 
(using non-deterministic guesses) and replaced by the same letter. The verification of feasibility of the 
guesses is done by comparing the lentghs and verifying the resulting system of Diophantine equations. 
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Finally, it turns that it is enough to check such a system, the actual lengths are not important; due 
to special form of this system, it can be checked, whether it has a solution, in linear space. 



Arithmetic expressions. Consider arithmetic expressions using natural constants and variables, 
such that all expressions are linear in these variables. These expression are obtained from a word 
equation U = V in a way described in the following subsection. We say that a set of ei, e2, . . . , e m is 
a small set of linear Diophantine expressions (for a word equation U = V) if 

• each variable in the expressions is either xx or yx, where X is a variable from U = V 

• if X appears k times in U = V then xx (yx) appears at most k times in expressions 

• the coefficients and constants in each expression are positive 

• the sum of absolute values of constants in {ej}£L x is at most \U\ + \V\ — n v . 

We say that a system of linear Diophantine equations and inequalities (all inequalities are of the 
form x > 1) is a small linear Diophantine system, if sides of its equalities form a small set of linear 
Diophantine expressions and each e, in this set is used at most two times in this system. As a simple 
consequence small system of linear Diophantine equations and inequalities has the following properties: 

• each variable in the system is either xx or yx, where X is a variable from U = V 

• if X appears k times in U = V then xx (yx) has some of absolute values of its coefficients at 
most 2k 

• the sum of absolute values of constants is at most 2(\U\ + |V|), (2(|J7| + |V| — n v ) comes from 
the equalities while 2n v from the right-hand sides of inequalities) . 

Lemma 12. If the equation U = V is represented using b bits, then the corresponding small linear 
Diophantine system can be encoded using 0(b) bits, moreover, it can be (nondeterministically) checked 
in the same space, whether it has a natural solution. 

Proof. We encode the equalities in unary: i.e. each constant c is represented as 1 + 1 + • • • + 1, while 

c times 

each ex is represented as x + x + ■ • • + x . The variables xx , yx are encoded in the same way as X in 

c times 

U = V, with additional bit to distinguish them. The assumptions on the small system guarantee that 

• the total space used by constants is 2(|f7| + |V| — n v ), which is at most 26 

• the space used by variables is at most 8 times as much as the space used by variables in U = V: 
a denotation of a variable xx (yx) is at most twice as long as the variable X and it appears 
at most 2 times more in the small linear Diophantine system as X in U = V . 

The additional space needed to denote '+' and '=' may increase the space usage only by a constant 
(note that we might need to change the denotation of other symbols a bit). 

The idea of the verification is that instead of guessing the whole solutions, we guess only the last 
bits (i.e. the parity of integer variables), verify this guess, simplify the equation and proceed: for each 
variable x we guess, whether it is pair or odd and appropriately replace it with 2x or 2x + 1. Then 
we verify the guess by checking whether both sides of each equality have the same parity. If so, we 
divide each side of the inequality by 2 (rounding down) and proceed in the same fashion. Note, that 
in this way, the coefficients at each variable remain constant. 

There is a little comment concerning the inequalities: all of them were initially x > 1, and during 
the algorithms they can be also of the form x > 0. When rounding down, we need to take care that 
rounding is done in an appropriate way, for instance 2x > 1 is in fact 2x > 2 and so after dividing 
and rounding we should end up with x > 1 again. Observe that this boils down to replacing x > 1 by 
x > if and only if x is replaced by 2x + 1, otherwise, the inequality remains as it were. 
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Algorithm 8 VerifyDiophantine Checks the satisfiability of a small linear Diophantine system 

1: while there is a non-zero constant or inequality x > 1 do 

2: for each variable x do 

3: guess b x G {0, 1} 

4: replace each x with 2x + b e 

5: if there is an equation with different parity of constants on the sides then 

6: return Unsatisfiable 

7: divide each equation by 2, rounding down 

8: divide each inequation by 2, round appropriately 

9: return Satisfiable > Has a trivial (0, . . . , 0) 



Suppose that the small linear system has a solution (q±,q2, ■ ■ ■ , Qr)- We show that for some nonde- 
terministic guesses, the obtained system has a solution (|_</i/2j, LQ2/2J, . . . , [q r /2\). Let the algorithm 
guesses the parity of x according to (q±, 02 > • • • , q r )- Then after the loop in line El the obtained system 
has a solution ( [_Qi/2j , L92/2J , . . . , [q r /2\ )• Since each coefficient by the variable is even, the constants 
at side of each equation should be of the same parity, and so the algorithm does not terminate in 
line Line [7] halves each equation. Observe, that an inequality 2x > is equivalent to x > and 
inequalities 2x > 1 in fact meant that 2x > 2 and so they are also simply halved. 

On the other hand, if (qi, q2, . . . , q r ) is a, solution after the changes, then (2gi + &i, 2q2 + &2i • • • , 2q r + 
b r ) was the solution of the system at the beginning of the iteration. 

Concerning the space usage, the inequalities are simply stored as a bit for each variable (bit set 
to i means that x > i). When the starting systems has size b, the intermediate ones have size 0(b): 
observe that the only new constants (which also stored in unary) are the Is from 2x + 1. When the 
sum of coefficients by variables is at most m, they may introduce up to m bits in one round. However, 
those bits are also halved in this round, so the memory consumption increases by m with regards to 
the starting system, so it is only constant time larger. □ 

Constructing a small Diophantine system from a word equation. For a solution S of U = V 
we define a (small) linear Diophantine system, such that each of its solutions will correspond to some 
solution of U = V. Different solutions S' of U = V defined in this way shall differ from S in lengths 
of maximal blocks in S'(U). We say that two solution S and S' are similar, if for every variable X the 
S(X) = E1E2 ■ ■ ■ Ej and S'(X) = E[E' 2 ■ ■ ■ EL where for each i the Ei and E[ are blocks of the same 
letter, usually of different lengths, although always at least 1. Consider now a similar representation of 
S(U) and S'(U') as concatenation of maximal blocks E\, E2, ■ ■ ■ , E^, and E[, E 2 , ■ ■ ■ , E' k , respectively. 
Since S and S' are similar, E^ is a crossing block in S if and only if E' k is in S'. For a fixed S, we are 
going to characterise the possible lengths of E[ using lengths £7j. 

The Lemma [2] suggests that the crucial to consider, when dealing with maximal blocks, are the 
lengths of the a prefixes and 6-suffixes of S{X) for different variables X. For every variable X, let 
xx denote the length of the maximal a-prefix of S'(X) and yx the length of the maximal 6-suffix of 
S'(X), where a and b are the first and last letter of S(X). In the latter, we treat those values as 
variables rather than as numbers. 

Consider e, = \E[\, let E[ be a block of as: 

• E[ may begin with either an explicit a or a maximal a-suffix of some S'(X) (which may be 
whole S'(X)); 

• 'in the middle' it may contain either explicit a's or S'(X) £ a + ; 

• it ends with an explicit a or a maximal a-prefix of some S'(X) (which may be whole S'(X)). 

Thus, whenever E\ is an explicit or a crossing block, e, is a linear combination of xx, yx, where 
X £ X, and natural numbers. 

We now write a (small) linear Diophantine system that bounds the ei, . . . , e& together. To this end 
we use S, of which we know that it is a satisfying solution of U = V and is equal to S' for xx = ix 
and yx = Tx- Recall that we say that a block of letters a 1 has a visible length (with respect to S), if 
a is a crossing block, an explicit block or it is a prefix or suffix of some S(X). Firstly we remove e, 
such that Ei does not have a visible length. Then for fixed p and a, where S contains an a's p block, 
consider the expressions E^, Ei 2 , Ei k corresponding to the visible p-blocks of a in S(U) and S(V). 
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We write the i m -\ equations equalising their lengths: 

we do so for every possible pair a, p. Thus we have obtained a linear Diophantine system in xx 
and yx- We add the inequalities xx > 1 (and yx > 1) for every variable used in the equalities. If 
some block variable X is a block of letters, we use only xx in the equations, yx is not used in the 
equalities. We claim that the constructed expressions form a small set of Diophantine expressions and 
consequently, the obtained system is a small linear Diophantine system; furthermore, each of solutions 
of this system corresponds to solution of the equation U = V. We first show the former. 

Lemma 13. The constructed Diophantine expressions form a small set, the constructed linear Dio- 
phantine system is small. 

Proof. We first show that the terms e±, . . . , e& are a small set of linear Diophantine expressions. For 
the purpose of the proof, denote by nx the number of times variable X is used in the equation U = V . 
We bound the number of times xx and yx appear in expressions e\, . . . , e^ and the size of additive 
constants used in ei, . . . , e&. 

• each xx (yx) appears at most nx times, as for a fixed appearance of variable X there is at 
most one maximal block E[ that spans over the prefix (suffix, respectively) of this appearance; 

• the total size of used constants is \U\ + \V\ — n v : for a fixed explicit appearance of a letter a, 
there are is exactly one maximal block E[ that spans over it. □ 

Lemma 14. Each solution {lx,i"x}x£X of @ induces a solution S' of the equation U = V. All 
such solutions S' are similar. 

Proof. We begin with showing the first claim of the lemma that each solution {lx, r x}xex of ([2D 
induces a solution S' of the equation U = V, which is similar to S. The proof will be a straightforward 
consequence of the definition of lx , rx and system © . 

Consider a variable X and the decomposition E\E 2 . . . Ei of S(X). Since we aim at having a similar 
solution S' , S'(X) is defined as E'^E^ ■ ■ ■ E[, where Ei and E[ are blocks of the same letter. It is left 
to define the lengths of E[, E' 2 , . . . , E[. 

Consider any solution lx and rx of ([2D for variables xx and yx, where X € X. Take a variable 
X, let S(X) begins with a and end with b, then S'(X) has a-prefix (6-suffix) of length lx {fx, 
respectively); if S(X) G a + then S(X) = a x . This defines lengths of some visible blocks in S'(U). 
Consider other block E[. If the corresponding Ei is not visible, we let E[ have the same length as Ei. 
If Ei has a visible length in S(U) then let j be such that Ej and Ei have the same length and Ej is 
visible, i.e. the length of E'a is already known. Thus we assign to E[ the same length as to E'a. 

In this way the E[ block has either the same length as Ei (when Ei does not have a visible length) 
or length e« with each xx substituted with lx and yx with rx, when Ei has a visible length. We 
claim that if Ei and Ej have the same length, also E[ and E'a have the same length: indeed, if Ei 
and Ej do not have visible lengths, then E[ has the same length as Ei and E'a as Ej, which shows the 
claim. If Ei and Ej have visible length, then ej and ej are equalised in ([2D and as {lx,fx}x&x is a 
solution, E[ and E'a have the same lengths This observation together with the fact that S' is similar 
to 5 and trivial induction shows that S'(U) = S'(V). □ 

Improving BlockCompNCr. The BlockCompImp works similarly as BlockCompNCr, but it does not 
guesses the lengths of the a-prefix and 6-suffix of S(X), instead it treats them as parameters (or 
variables) xx and yx, i-e. it replaces X with a Xx Xh yx . Then it identifies the maximal explicit blocks 
E\ , E 2 , ... of the same length (non-deterministically) , the visible lengths e\ , e 2 , ... of these blocks are 
expressed using constants and parameters {lx,rx}x<=x- Then the guess is verified by checking the 
feasibility of a system of linear Diophantine equations: if Ei and Ej are guessed to have the same 
length, then ei = ej. Lastly, we add the conditions xx > 1 and yx > 1 for the variables. If the system 
if feasible, then we replace blocks of the same letter and the same length by a fresh letter. 

Lemma 15 (cf. LemmaED- BlockCompImp preserves unsatisfiability. 

For a solution S of U = V, for appropriate nondeterministic choices it preserves the satisfiability 
and implements the block compression (of all letters a). 

It uses a constant time more memory than the equation U = V . 

Using BlockCompImp in WordEqSat instead of BlockComp reduces the space usage of WordEqSat 
to 0(n\ogn). 



RECOMPRESSION: A SIMPLE AND POWERFUL TECHNIQUE FOR WORD EQUATIONS 15 



Algorithm 9 BlockCompImp 



1 
2 
3 

4 

5 
6 

7 
8 

9 
10 

11 
12 
13 
14 
15 



run CutPrefSuff, in which all xx and yx are integer variables > there are no crossing blocks 

let £ = {Ei, . . . , Efc} be the explicit maximal blocks > Compression of a blocks starts 

for each Ei do 

let ei •*— \Ei\ > Given in terms of constants and parameters xx and yx 

partition £ into £\, £2, ■ ■ ■ , £ m > each £, has only as blocks for some a > Guess 

for each £i = {E^ ,■■■ , Ei k } do > Construction of the linear Diophantine system D 

for each Ei- € £i do 

add an equation e^ = e^ +1 to D > Ignore the meaningless last equation 

for X £ X do 

add inequalities xx > 1 and yx > 1 to D 
run VerifyDiophantine on D 

for each £j = {E^ , . . . , ^i fe } do 
let a e . G r be an unused letter 



l e, 



for each £7j . € <Si do 

replace every E^. by a e - 



Proof. Suppose that BlockCompImp transformed U = V and created a linear Diophantine system D 
that has a solution {lx,rx}xex- Then we can think of BlockCompImp as if it replaced each X with 
a^ x Xb rx and then replaced some blocks of the same letter and the same length with fresh letters. 
Thus by Lemma [3] it preserves unsatisfibility. 

In the other direction, suppose that U = V has a solution S. Let lx and rx be the (non-zero) lengths 
ax-prefix and ox-suffix of S{X) for each variable X. We want to show that with appropriate choices, 
i.e. xx = lx and y x = rx, the run of BlockCompImp results in the same instance as BlockComp. 

Observe that BlockComp first replaces X with a x Xb' x and then it replaces explicit blocks of 
the same letter. Let e' l5 e' 2 , ...be the lengths of the consecutive maximal blocks in U = V (after 
CutPrefSuff). Let BlockCompImp guess in the following way: it partitions the maximal blocks Ei. E2, 
. . . according to S, i.e. Ei and Ej are in the same partition class if they are blocks of the same letter 
and e'i = e'-. Observe that {lx,rx}xex is a solution of the corresponding system, furthermore, e\ 
is equal to e, in which for xx substituted lx and for yx substituted rx- Hence, the system D is 
satisfiable. Then BlockCompImp replaces blocks which corresponds with the replacement of blocks by 
BlockComp (up to renaming letters). Hence, BlockCompImp also implements the blocks compression 
(of all letters a). 

Concerning the memory consumption, observe that by Lemma fT21 the linear Diophantine system 
can be encoded using only a constant more bits than the word equation. All other operations can be 
easily implemented in the same memory bounds. □ 

5. Linear space for 0(1) variables 

As already shown, the length of the word equation kept by WordEqSat is linear. However, the 
letters in this equation can be all different, even if the input equation is over two letters. Hence the 
space usage is (nondeterministic) O(nlogn) bits. We would like to improve the space consumption 
to linear; to be more precise, we would like the space consumption to be 0{m) bits, where the input 
equation used m bits in a natural encoding. We fail in a general case, such a bound is shown only for 
0(1) variables (although it holds for arbitrary many appearances of these variables in the equation, 
i.e. n v is not bounded and the alphabet size is arbitrary). The proofs given in this section work 
assuming that each appearance of a letter (variable) in the input is always given using the same bit 
representation, however, it is not assumed that all letters and/or variables have the representations of 
the same length, i.e. the presented method works also when the input equation is compressed using 
Huffman coding. 

The main obstacle is the encoding of letters introduced by WordEqSat. We show that when we 
look at the computation of WordEqSat that do not remove the variables from the equation, the 
space consumption can be limited to 0(n). Then for k = 0(1) variables we can consider k stages 
of WordEqSat, a stage ends when a variable is removed from the equation. In this way the space 
consumption will be estimated by c k n bits, which is linear for constant k. 
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It is left to describe a suitable encoding of letters. Consider string of explicit letters between two 
consecutive variables X and Y in U = V, together with the variables. During WordEqSat the XwY 
will be changed to Xw^'Y, Xw^ 2 'Y, .... Observe, that each w^ l > can be partitioned into 3 substrings 
xW^WyW, where the letters in yW represent solely the letters from w, while each letter in x^ l > {y^ l >) 
represent also some letter popped at some point from X (Y, respectively). Note, that it is easy to 
encode jjW using only a constant time more bits than w: roughly, we represent letters as trees and 
when merging a and b into c, the tree of c is has the tree of a as a left subtree and a tree of b as a 
right subtree; using any usual encoding the size of such representation is only constant times larger 
than the original text in. 

On the other hand, the letters in x^' and j/W depend solely on XwY , so we simply encode them 
as (XwY)l, [XwY)2, . . . , (XwY)(\x^'\ + \y^'\), where '(XwY)' is encoded exactly as it was in the 
input equation while the following numbers are encoded in binary. Note that the same code i (XwY)V 
is (usually) used in each phase, but it denotes different letters in the respective phases. 

Observe, that in this way different appearances of the same letter a may get the same code: in such 
case we collect the codes for a and add the information that they all represent the same letter. 

However, this approach raises a new concern: it might be that the length \x^' \ + \y^' | is non-constant: 
WordEqSat only guarantees that the length of the whole \U\ + |V| is 0(n), but some fragments may 
become large. However, for 0(1) variables this can be solved easily, as we can enforce that in one phase 
each pair of consecutive letters is compressed: firstly, a simple preprocessing (to be precise, Pop(r, T)) 
ensures that there are only 0(k) crossing pairs, where k is a number of variables. Then non-crossing 
pairs are compressed separately (not causing any increase of size of the kept word equations) and each 
of the crossing pair ab is compressed using PairComp(a, b). Then each substring of S(U) is compressed 
and the size of kept equation remains linear. 

Algorithm 10 LinWordEqSat Checking the satisfiability of a word equation in linear space for 0(1) 
variables 

l 
2 
3 
4 
5 
6 
7 

8 
9 

10 



while \U\ > 1 or \V\ > 1 do 

BlockCompImp > Block compression 

Pop(r, r) > The number of crossing pairs is 0(k) 

P 4— list of non-crossing pairs > Guess 

P' <— list of crossing pairs > Guess 

for ab G P do 

run PairCompNCr(a, b) 
for ab € P do > P' = 0{k) 

PairComp({a}, {&}) 

Solve the problem naively > With sides of length 1, the problem is trivial 



The properties of LinWordEqSat are summarised in the below theorem. 

Theorem 2. LinWordEqSat preserves unsatisfiability and satisfiability. For k variables, it runs in 
(nondeterminsitic) 0(mk ck ) bits, for some constant c, where m is the space consumption (measured 
in bits) of the input word equation. 

For the input equation U = V define consecutive stages, a stage ends immediately when one variable 
is removed from the kept equation. Then the next stage starts instantly afterwards. 

We begin with showing the correctness of LinWordEqSat, the proof is a slight modification of the 
proof of correctness of WordEqSat, see Lemma [TT] 

Lemma 16. LinWordEqSat preserves unsatisfiability and satisfiability. The kept equation has length 
0(kn) in one stage, where equation at the beginning of the stage has length n. 

Proof. All subprocedures in LinWordEqSat are known to preserve satisfiability and unsatisfiability, note 
that a proper guess of noncrossing pairs in lineH]is needed, as PairCompNCr(a, b) preserves satisfiability 
only for a noncrossing pair ab. Observe that if a'b' is another noncrossing pair to be compressed, then 
after PairCompNCr(a, b) the pair a'b' is still noncrossing. So also LinWordEqSat preserves satisfiability 
and unsatisfiability. 

Concerning the space consumption: since we try to compress each crossing pair, a stronger version 
of Claim [T] can be shown: 
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Claim 2 (cf. Claim [I]). Let U = V has a solution S. For appropriate choices, the equation U' = V 
obtained at the end one stage of LinWordEqSat has a solution S' such that 

• for each pair of two consecutive letters in U or V, one of these letters is compressed in U' or 

• for each pair of two consecutive letters in S(U), one of these letters is compressed in S'(U'). 

Proof. Consider any two consecutive letters ab. If a = b then they are compressed by BlockComplmp. 
If they are not and one of the letters is compressed in BlockComplmp then we are done. Otherwise, 
ab will be either in P or in P' and we try to compress it. We fail only if one of those two letters was 
already compressed. □. 

To show the bound on the length of the kept equation, we first estimate that the number of crossing 
pairs is indeed 0(k). Observe that after Pop(T, T) in lineEJeach occurrence of a variable X is preceded 
(succeeded) by the same letter, say ax (bx, respectively). When b (a) is the first (last, respectively) 
letter of S(X), the X brings only two crossing pairs axb and bx&- As there are k different variables, 
there are at most 2k different crossing pairs. 

Using a similar argument as in Lemma EJ it can be shown that the length of the kept equation is 
0(nk), as Pop is run k + 1 times in one stage and in each phase it introduces at most 0(n) letters. □ 

We distinguish two types of appearances of an explicit letters in U = V in one stage: inner and outer 
appearances; note, that the same letter a may have at the same time an inner and an outer appearance. 
Each explicit letter at the beginning of the stage is inner, each letter popped from a variable is outer. 
When we compress two (or more) inner letters, the result is an inner letter; otherwise the merged 
letter is outer. Observe that this implies that each substring w between two variables, say X and Y, 
can be partitioned into w = xvy, where x, y consist solely of outer letters and v consists solely of inner 
letters. LinWordEqSat guarantees that \x\ and \y\ are linear in k: 

Lemma 17. In one stage, at the beginning of the phase, the maximal substring of outer letters has 
length O(k). 

Proof. As there are k + 1 application of Pop, the length of such block increases by at most Ik + 2 
(it may be expanded from both ends in one phase). On the other hand, by Claim [5] each substring 
of length 4 is replaced by a substring of length 3 or less in one phase of LinWordEqSat. This applies 
to the substrings of outer letters, and similarly as in ([T]) it can be shown that these substrings have 
length at most O(k). □ 

The inner letters are encoded as follows: when compressing two (or more) inner letters represented 
as a and b we represent them as (a,b), where '(', ',' and ')' are some appropriately coded symbols; 
we can think of this as a flattened tree. Note, that in this way when a string of input symbols is 
compressed into string w', then w' uses only constant time more bits than w. 

The outer letters are encoded in a different way: note that if XwY has two different appearances 
in U = V then in both of them the outer letters (and inner ones) will be equal in one stage, and so 
can be encoded using the same symbols. 

Lemma 18. Let XwY has two different appearances in U = V at the beginning of the stage. Then 
in this stage both those appearances are represented using the same strings. 

Proof. Observe that the letters popped from a variable depend only on the variable and not on the 
surrounding letters. Then the string w between those two variables is transformed exactly in the same 
way in both appearances. □ 

By Lemma [T7] and [TS1 outer letters that appear in the string representing XwY can be encoded as 
(XwY)j^(number of letter), where (XwY) is encoded as in was in the equation at the beginning of 
the stage and the following number of letter is encoded in binary. By Lemma [T71 only 0(k) number of 
different letters per XwY is encoded in this way, and each appearance of such a letter can be charged 
to an appearance of XwY at the beginning of the stage, so the space consumption can be bounded 
as a constant times the original consumption. 

Observe that the same letter may be encoded in several different ways, this is not a problem, we 
separately keep a list of different representations of the same letter. Note that this increases the space 
consumption by a constant. 
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Lemma 19. The space consumption in one stage of LinWordEqSat is at most 0(km), where m is the 
size of the representation of the equation at the beginning of the stage. 

Proof. This follows from the discussion above. □ 

proof of Theorem^ Since the number of different variables is k, there are at most k stages. Note that 
during one stage the space consumption increases at most ck times, where c does not depend on k, 
nor n, see Lemma [191 Thus, the total space consumption is at most (kc) times greater than the one 
of the input equation. 

The correctness follows from Lemma [TB] □ 



6. Solutions other than length minimal 

In the next section we are going to give algorithm generating a representation of all solutions of 
a word equation. However, so far we have considered mainly the length minimal solutions, and clearly 
there are other ones. In this section we recall the classification of solutions, taken from work of 
Plandowski [20j . The main result of this classification is the identification of the minimal solutions, 
which have all properties of the length-minimal solutions that we use, except the exponential bound 
on the exponent of periodicity. The solutions (substitutions) are classified not by their length, which 
depends on the actual equation, instead it looks on whether one can be obtained from the other using 
homomorphisms. 

We first extend the notion of the solution, so that it can include letters that do not appear in the 
equation: By T' we denote the letters that can appear in the solution, even though they do not appear 
in the equation; formally V is an arbitrary set such that I"" n T = (and of course IT" D X = 0). Then 
substitution is a morphism S : X U T — > (T U I"")* that satisfies the previous assumption that S(a) = a 
for every a E T; a notion of the solution generalises to this setting. We call r" free letters of the 
solution. 

We use the name operator to denote functions transforming substitutions. A special class of oper- 
ators is particularly important for us: given a morphism <j) : Y U T' i— > (r U r')* by $ (so capitalised 4>) 
we denote a corresponding morphism that acts on substitutions, changing S(X) by <fi, to be precise 
<5[5](X) = (f>(S(X)) and $[5](a) = a for a € T U I"". For composition of operators we use the usual 
symbol o, however, when indexed composition is used, we denote it by Yl, for lack of a better symbol. 

Definition 2 (cf. |2Uj). A solution S : X U T — > (r U T')* of an equation U = V is a unifier (with free 
letters V), when S(U) contains at least one letter from V. S' is an instance of a unifier solution S, if 
S' = &[S] for some non-erasing non-permutatingj morphism <ft : (T U T') i— > (r U r') + that is constant 
on r. A solution S is minimal, if it is not a unifier solution, nor an instance of a unifier solution. A 
unifier solution is minimal, if it is not an instance of a unifier solution. 

The assumption that the instance of a unifier solution is obtained by a non-erasing morphism is 
technical, but it ensures easier and cleaner classification of minimal solutions, for instance, each length- 
minimal solution is also minimal. It is easy to observe that as I"" n T = 0, every instance S' of a unifier 
solution S is a solution (perhaps a unifier one). 

Example 2. Consider an equation aXb = Y. Then each S(X) = w and S(Y) = awb is a solution. If 
w = e then S is a length-minimal solution, it is also a unique minimal solution; when w contains a free 
letter, then S is a unifier solution, when additionally aiGf' then this is a minimal unifier solution. 

Consider an equation aX = Xa, then each S(X) = a n is a minimal solution, S(X) = e is a 
length-minimal one; there are no unifier solutions. 

Consider an equation aXYX 3 = XYaY 2 . Since S(aXY) and S(XYa) have always the same 
length, this is equivalent to a system of equations aXY = XYa and A 3 = Y 2 . The former has 
solutions X = a n ,Y = a m and the latter ensures that 3n = 2m. All such solutions are minimal and 
S(X) = S(Y) = e is length-minimal. There are no other solutions, in particular, no unifier solutions. 



A morphism <fi is non-erasing if 4>(a) ^ e for every letter a and it is non-permutating if <j) is not a permutation on its 
domain. 
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Typical operators. While in the definition of minimal solutions the operator $ corresponding to a 
morphism <f> is arbitrary, in the proofs we usually see morphisms that are related to pair compression 
and blocks compression. By h c ^. ab denote the morphism which replaces c by ab and is constant on all 
other letters, the h~. b is the corresponding inverse morphism (note, that when a ^ b the inverse is 
well-defined); by b a denote the morphism which, for each £, replaces ai by a 1 , since a block of a can 
has various partition into subblocks of as, 6" 1 is not well defined. For the purpose of this paper, we 
specify its action as follows: b~ x replaces each a's maximal ^-block by a letter a?. The H c _> ab and B a 
denote the corresponding operators, H~_^ ab the inverse operator and B~ l the inverse mapping. 

Properties of minimal solutions. As already noted, the minimal solutions inherit most of the 
crucial properties of length-minimal solutions. In particular, a variant of Lemma [5] holds for them. 

Lemma 20 (cf. [221 Lemma 6], cf. Lemma [2]). Let S be a minimal solution of U = V . 

• If ab is a substring of S(U), where a^b, then ab is an explicit pair or a crossing pair. 

• If a k is a maximal block in S{U) then a has an explicit appearance in U or V and there is a 
visible appearance of a . 

Proof The first claim, which regards a pair ab, is shown using the following fact (we do not assume 
that S is minimal, as we reuse Claim later on in this more general setting) : 

Claim 3. If ab is not an explicit nor crossing pair for a solution S for U = V, then S' = H~ b [S] 
for a free letter c G V is a unifier solution of U = V. In particular, S = H c ^ ab [S'] is an instance of S' 
and so it is not minimal. 

Proof. Consider S' = H~_^ ab [S]. Since ab is not an explicit nor a crossing pair, each appearance of ab 
in S(U) (and S(V)) comes from S(X) for some variable X. Thus S'(U) is obtained from S(U) be 
replacing each ab by c, which can be used to show that S' is a solution: 

S'(U) = (H^JS])(U) = h- c \ ab {S{U)) = K\ ab (S(V)) = (H-_lJS])(V) = S'(V). 

Since c is a free letter, this means that S' is a unifier solution. Observe also, that as c does not 
appear in S(X) for any X, then (H c -+ a -b o H~_^ ab )[S] = S: indeed, the H~_^ ab replaces each ab by c in 
every S'(X), while H c ^ ab replaces each c by ab in every S(X). Hence, S = H c ^ ab [S'\ and as h c -^ a b 
is non-erasing, non-permutating and constant on T, we conclude that S is an instance of S', which 
contradicts the assumption that it is minimal. □ 

Consider now the second claim, which regards the maximal blocks of a. Observe that if a appears 
in S(U) and it does not appear in U, nor V, then it is a letter from T' and so, by definition, S is a 
unifier solution and thus cannot be a minimal solution, hence a appears in U or in V. 

To streamline the presentation and analysis, in the remainder of the proof assume that both U 
and V begin and end with a letter and not a variable; this is easy to achieve by prepending $ and 
appending $' to both sides of the equation. Alternatively, the cases with variables beginning or ending 
U or V can be handled in the same way, as the general case. 

Consider a maximal a block a , for i > in S(U) and the letter preceding (succeeding) it, say b 
and c, respectively; by the assumption that U and V begin and end with a letter, such b and c always 
exist. Consider the appearances of ba c in S(U) and S(V). Since b ^ a ^ c, these appearances cannot 
have overlapping a's (though, if b = c, these letters can overlap for different appearances). Suppose 
that none of these appearances is crossing or explicit. Then for each of such appearances there is a 
variable Y such that ba c is wholly contained within some appearance of S(Y). Change the solution S 
into 5', by replacing each ba l c in each S(Y) by bvc for a free letter v; since a 1 in various appearances 
of ba c do not overlap, such replacement is well-defined. Then S' is still a solution. In fact, it is a 
unifier solution. Furthermore, S is its instance, contradiction. Hence, there is an explicit or crossing 
appearance (with respect to S) of bare. Then this appearance restricted to a satisfies the claim of 
the lemma. □ 

Minimal unifier solutions. It is already known from the work of Plandowski [20], Lemma 1] that 
finding minimal unifier solutions reduces to finding minimal solutions. This is a consequence of the 
following lemma, which is strengthening of Lemma 1201 to minimal unifier solutions. 

Lemma 21 (cf. [2DT Lemma 1]). If S is a minimal unifier solution with a free letter v € T' , then for 
some variables X and Y it holds that v is the first letter of S(X) and v is the last letter of some S(Y). 
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Proof. The proof is similar to the proof of Lemma I2U1 Suppose that v is not a last letter for any S(X). 
Consider any appearance of v in S(U), and let a be some letter directly to the right of one of v's 
appearances. The pair va is non-crossing for S and S = H^^ va [S'] for some fresh letter b and a unifier 
solution S', by Claim [31 To conclude that S is not minimal, it is left to show that h^^ va non-erasing 
(obvious), non-permutating (also true, as h},^ va (b) = v a £ T U I"") and constant on T (true, as b £ T). 
Symmetric argument can be given, when v is not a last first of some S(X). □ 

Intuitively Lemma ED yields that search for minimal unifier solutions reduces to looking for minimal 
solutions: it is enough to 'left-pop' a letter from each variable, in this way the free letters are introduced 
into the equation and become standard letters. With appropriate nondeterministic guesses, so unifier 
solutions of U = V will correspond to non-unifier solutions of U' = V'. The precise statement needs 
some additional definitions, which are introduced in the next section; thus the formal statement is 
deferred to the following section, see Lemma | 



7. Representation of all solutions 

The first PSPACE algorithm for verifying the satisfiability of word equations, PlaSat, was extended 
by its author to PlaSolve, which returns a finite, graph-like, representation of all finite solutions of a 
given word equation |20j . This extension is done in two stages: firstly the original PlaSat is modified 
into another algorithm PlaSatlmp, which also only checks satisfiability of word equations; then PlaSolve 
uses PlaSatlmp as a subprocedure in generation of a graph representation of all finite solutions of a 
word equation. The modification into PlaSatlmp is nontrivial, and its correctness required a separate, 
involved proof. In this section we show that WordEqSat that uses BlockComplrnp instead of BlockComp 
also can be used to generate a (similar) representation of all solutions of a given word equation. 

Representing all solutions. We want to use WordEqSat, which still only verifies satisfiability, as a 
subprocedure for an algorithm generating a (finite) description of all finite solutions. The approach 
is simple and in fact is similar to the earlier approach used by Plandowski [2D]: the representation is 
modelled by a graph, with nodes labelled with equations U = V that are considered by WordEqSat and 
edges representing transformation performed by WordEqSat. To be precise, if an equation U = V is 
transformed into U' = V by WordEqSat (for some nondeterministic choices) we put an edge between 
nodes labelled by these two equations and label it with an operator that transforms solutions of 
U' = V into solutions of U = V; furthermore, each solution of U = V can be represented in this way. 
Also, nodes with trivial equations (i.e. \U\ = \V\ = 1) have only one, easy to define, minimal solution 
(or no solution at all). Since WordEqSat runs in PSPACE, such generation of labelled vertices and 
edges can also be performed in PSPACE. 

Theorem 3 (cf. [20]). The graph representation of all minimal solutions and minimal unifier solutions 
of an equation U = V can be constructed in PSPACE. The size of the constructed graph is at most 
exponential. 

As already noted, representing all minimal unifier solution can be reduced to representing all min- 
imal solutions, which will be formally stated in Lemma [23 Thus, in the following, we focus on 
the representation of minimal solutions of a given word equation. We begin with the description of 
operators that are used to transform the solutions. 

Transformign solutions and inverse operators. So far we only know that WordEqSat preserves satisfi- 
ability and unsatisfiability, however, we do not really know what happens with particular solutions: 
we do not know how to obtain the solution of the original equation U = V from the transformed 
equation U' = V , even worse it might be that many of them are somehow lost in the translation. We 
strengthen the notion of preserving the satisfiability (and unsatisfiability). 

Given a (nondeterministic) procedure transforming the equation U = V we say that this procedure 
transforms the solutions, if based on the nondeterministic choices and the input equation we can define 
a family of operators "H such that 

• for any solution S of U = V there are some nondeterministic choices that lead to an equation 
U' = V' such that S = H[S'] for some solution S' of the equation U' = V and some operator 

Hen-, 

• for every equation U' = V that can be obtained from U = V and any its solution S' and for 
every operator H € % the H[S'] is a solution of U = V. 
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We also say that the equation U = V with its solution S are transformed into U' = V with S' and 
that 7i is the corresponding family of inverse operators. In many cases, % consists of a single operator 
H, in such case we call it the corresponding inverse operator, furthermore, in some cases H does not 
depend on U = V , nor on the nondeterministic choices. 

If for minimal solution S the nondeterministic procedure transforming U = V with S into U' = V 
with S' , where S' is also minimal regardless of the nondeterministic choices, we say that this procedure 
preserves minimality of solutions. 

Our main goal is to show that subprocedures of Word Eq Sat transform the solutions, give the ap- 
propriate family of operators and show that they all preserve minimality. 

Operators appearing in WordEqSat. We describe the family of inverse operators corresponding to 
various subprocedures of WordEqSat. Define an operator Prepend^, x for a string w S (TUT')*, which 
prepends w to solution for X and leaves other variables untouched, formally: 

Prepend wX [S](X) = wS(X) and Prepend w>x [S](Y) = S(Y), for Y ^ X. 

Define Append^ x similarly, by appending w to S(X). 

Lemma 22 (cf. Lemma[S]). Pop(ri,r2) transforms and preserves minimality of the solutions. 

Suppose it left-popped b x £ T r U {e} and right-popped ax € 1^ U {e} from X , then the corresponding 
family of inverse operators consists of only one operator: 

H = ]J A PP end a x ,x ° Pre Pend fejf)X . 

xex 

Proof. Fix the nondeterministic choices and let H be as defined in the lemma statement. Observe 
that from the proof of Lemma it follows that if S is a solution of U = V then U' = V has a solution 
S' such that S = H[S']. On the other hand, when S' is a solution of U' = V then S = H[S'] satisfies 
S(U) = S'(U') and similarly S(V) = S'(V), which makes S a solution of U = V. 

It is left to show that if U = V with minimal solution S is transformed to U' = V' with S' then 
also 5" is a minimal solution. Suppose that S' is not minimal, i.e. it is either a unifier solution or an 
instance of a unifier solution. 

it is a unifier solution: Then S' has a free letter. As H only prepends and appends letters, 
also S = H[S'] has a free letter, which makes it a unifier solution and thus contradicting its 
minimality. 

it is an instance of a unifier solution: Then S' = $[S"'] for some unifier solution S" and 
non-erasing, non-permutating morphism <f> which is constant on V. Observe that S = H[Q[S"}}. 
We claim that S = &[H[S"]\: indeed, this follows from the fact that H only prepends and 
appends letters from T, which are not affected by (/>. Since H[S"] has a free letter, this makes 
S' an instance of a H[S"], contradiction. □ 

We now investigate the inverse operator associated with PairComp: 

Lemma 23 (cf. Lemma|H|. PairComp(T^, T r ) transforms and preserves minimality of the solutions. 

Let H be the inverse operator of the Pop applied in PairComp, furthermore let PairComp replaced 
pairs ab € T^rv with c- ab > . Then the corresponding family of inverse operators consists of only one 
operator: 

(3) Ho J] H c(ab) _> ab . 

ab£F e r r 

Proof. Observe that by Lemma [221 the application of Pop transforms the solutions and preserves their 
minimality; furthermore, the inverse operator for it is well-described. So it is left to consider the 
compression of pairs performed by PairComp. 

Note that such a compression is a composition of many PairCompNCr, so it is enough to show that 
PairCompNCr(a, b) preserves minimality and when it transforms U = V to U' = V' with S' , then 
H~_^ ab is its inverse operator. 

Suppose that S is a solution of U = V. Observe that for appropriate non-deterministic choices 
(done in Pop) the pair ab is noncrossing, see Lemma [5) Moreover, the compressions of a pair a'b' 
cannot make ab crossing, as a'b' do not overlap with ab, the letter that replaced a'b' is not a and 
not b and lastly no letters are popped from the variables during the compression of non-crossing 
pairs. Then by Lemma HI the equation U' = V returned by PairCompNCr(a, b) has a solution S' 
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such that S(U) = h c ^ a b(S'(U')). Since PairCompNCr(a, b) does not modify variables, this means that 
S = H^ablS'], as claimed. 

On the other hand, suppose that S' is a solution of U' = V returned by PairCompNCr(a, b). Define 
S = H c ^ a i, [S 1 ] . Then clearly it is a solution of U = V: note that S(U) = h c ^. a b(S'(U')), as each 
explicit c in U' was obtained by replacing ab by PairCompNCr(a, b), while each implicit c in S'(U') was 
replaced by ab by H c ^ a b. In the same way S(V) = h c ^. a b(S'(V')), which shows that S is a solution of 
U = V. 

It is left to show that PairCompNCr(a, b) preserves minimality. Suppose it does not. Let S be 
a minimal solution of U = V and suppose that U = V with S is transformed into U' = V with a 
solution S', which is not minimal. There are two cases: either S' is a unifier solution, or it is an instance 
of the unifier solution; we consider only the latter, the former is shown using a similar argument. Then 
on one hand S = H c ^ a b[S'] and on the other S' = $[5"] for morphism (j> : T U V i— > (T U T') + which 
is non-erasing, non-permutating and constant on T. Then S = {H c _^ a b o <J>)[S"'] and it is left to show 
that if c _j. a b o $ corresponds to some non-erasing non-permutating morphism (f)' . Define 4>'{x) = x for 
x G r and 4>'(x) = (h c ^ a b ° 4>){ x ) f° r x £ T'. It is easy to observe that <\> is non-erasing: indeed, both 
4> and h c ^ a b are non-erasing and so their composition is also non-erasing. Clearly it is constant on T. 
Lastly, it is non-permutating: if there is x £ V such that (j)(x) j^ V then 4>'{x) = (h c ^ a b o <j)){x) ^ V 
and so 4>' is non-permutating. If for each x £ T' it holds that 4>{x) = x then 4>'(x) = (p(x), as c ^ T', 
hence (/>' is non-permuting, since cp is not. □ 

Block compression. For block compression BlockCompImp, the family of inverse operators is quite 
complicated. Instead of introducing it and proving its properties in one go, we choose to make some 
intermediate steps, which hopefully make a smoother presentation. We begin with describing the 
family inverse operators for BlockCompNCr, CutPrefSuff and finally for BlockComp; for each of the 
this family consists of a single operator. Then we show that a run of BlockCompImp represents several 
runs of BlockComp, and so in some sense it is a 'parametrised' BlockComp. In particular, the family 
of inverse operators for BlockCompImp represent (in a parametrised way) several inverse operators for 
different runs of BlockComp. 

Lemma 24. CutPrefSuff transforms the solution and preserve their minimality. The corresponding 
inverse operator is 

Y[ Prepend^ x o Prepend 6 ^ x , 

XGX x 

where a-% ftfjc ) * s ^ e P re fi x ( su ffi x > respectively) removed from S(X). 

Proof. The proof is similar to the proof of Lemma \T2\ and it is thus omitted. □ 

This allows stating the result for BlockCompNCr: 

Lemma 25. BlockComp transforms the solutions and preserves their minimality. 

Let H be the inverse operator of the CutPrefSuff, then the corresponding inverse operator is 

h o n B a , 

aer 
where b a replaces a with a^. 

Proof. By Lemma [211 the CutPrefSuff transforms the solution and preserves their minimality. Further- 
more, for appropriate choices, there are no crossing blocks in the obtained U' = V with respect to S", 
see Lemma [71 

Observe that afterwards BlockComp is a composition of BlockCompNCr(a) for all letters a. The rest 
of the proof is similar to the one in Lemma 1251 Furthermore, BlockCompNCr preserves minimality, 
which also can be shown as in Lemma E3J □ 

Now we are ready to define the family of operators for BlockCompImp. Intuitively, it will encode 
several inverse operators associated with different nondeterministic choices of BlockComp in a compact 
way: instead of making explicit guesses about the lengths of the prefixes and suffixes of S(X), it 
will parametrise them using variables xx and yx- On the other hand S'(U) contains letters, which 
represent blocks of letters whose length somehow depend on xx and yx- The coherence of all these 
lengths is guaranteed by an appropriate system of Diophantine equations. 
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Suppose that BlockCompImp constructs a linear Diophantine system D in variables {xx,yx}xcx, 
and popped a prefix of ax and suffix of bx from X. (Note that D depends on the nondeterminis- 
tic choices of BlockCompImp.) Then we define a (finite or infinite) family of inverse operators: let 
{ej}™ ! be the expressions on the sides of this system, define a family of operators Tin'- for a solution 
{lx,fx}xex of D define the numbers val(ej) which is obtained by substituting values {lx,fx}xex 
into ej. Then the following operator is in Hb'- 

(4) I [] Prepend^ x o Append^ x I o [] B' 1 , 

\XeX X ' / a£T 

where B~ l replaces the letter a e; by a val ( e i). Other letters a,-, which represent blocks of invisible 
length, are replaced with arbitrary blocks of a's. 

Lemma 26. Let BlockCompImp returns the linear Diophantine system D. Then BlockCompImp trans- 
forms the solutions andTio is the corresponding family of inverse operators. Furthermore, BlockCom- 
pImp preserves minimality of solutions. 

Proof. We know that BlockComp transforms the solutions and that the inverse operators can be read 
out of their code and nondeterministic choices, see Lemma 1251 Our goal is to show that the compression 
performed by BlockCompImp (when transforming U = V to U' = V) is identical (up to renaming of 
fresh letters) to runs of BlockComp that also transform U = V to U' = V; furthermore, the associated 
inverse operator is the same. We shall not formulate the exact conditions on these correspondence, 
instead we use this intuitive idea to prove the claim of the lemma. 

Consider an equation U = V and its solution S. We want to show that for some choices of 
BlockCompImp the obtained equation has a solution S' such that S = H[S'] for some H E H, where 
% is the corresponding family of inverse operators. To this end we use the fact that this is true for 
BlockComp, see Lemma 1251 

Let lx and rx be the (non-zero) lengths ax-prefix and ox-suffix of S(X). Consider a run of 
BlockComp which transforms U = V with S into some U' = V with a solution £", which popped ax- 
prefix and 6x-suffix for X of lengths lx and rx, respectively. Now, consider the run of BlockCompImp 
which left-pops a x x and right-pops bj? from X. Let the maximal blocks obtained in line [5] be 
{Ei, . . . ,Ef.}, let also ei, . . . , e& denote their lengths (expressed in terms of xx and yx)- 

Let us return to run of BlockComp, consider the maximal blocks in the equation obtained by 
BlockComp. Then the i-th. block has length val(ej) equal to ej with lx substitued for xx and rx for 
yx, for each X € X. Guess the partition in BlockCompImp according to the rule: E{ is in the same 
part as Ej if and only if they represent blocks of the same letter and val(ej) = val(ej). Observe, 
that {lx,rx}xex is a solution of the system D returned by BlockCompImp: the i-th block has length 
val(ej) and the equation ej = ej is added if and only if val(ej) = val(ej). 

Note that both BlockCompImp and BlockComp transform U = V to the same equation U' = V (up 
to replacement of letters a ei , which are used by BlockCompImp, by letters a va j( e -), which are used by 
BlockComp). Lastly, consider H, the inverse operator corresponding to this run of BlockComp. Then 
H € Hd'- w e know from Lemma !2o1 that H is equal to 

( \{ Prepend^ x x o Appendix x ) o J[ B a , 

xgx x ' aer 

and as {lx,^x}xex are a solution of D, then straight from the definition, this operator belongs to 
H D . 

We now show that that if an equation U = V is transformed by BlockCompImp into U' = V' 
which has a solution S' and H S %r> then H[S'] is a solution of U = V. Consider the system D 
that was constructed by BlockCompImp on the way, and an operator H € %d- Consider the solution 
{lx,fx}xex to which H corresponds. Consider a run of BlockComp which cuts an ax-prefix (bx- 
suffix) of length lx (rx, respectively) out of the variable X. Also, let BlockComp replace blocks 
corresponding to Ej £ Ei from BlockComplmp(a) by a letter a ei (the names of the fresh letters do 
not matter). As in the earlier part of the proof observe that such BlockComp and BlockCompImp 
transform equation U = V to the same equation U' = V and H is the corresponding inverse operator 
for BlockComp. Let 5" be a solution for U' = V. Then, by Lemma \25\ H[S'] is a solution of U = V, 
as claimed. Furthermore, since BlockComp preserves minimality of solutions, if S is minimal, so is 
S'. Since the choice of 5 was arbitrary, we conclude that BlockCompImp preserves minimality of 
solutions. □ 
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We call the families of operators (J3J) and (JU) proper. 

Representation of minimal solutions. As already said, the set of minimal solutions will be rep- 
resented by a directed graph. 

Definition 3. We say that a word equation U = V is proper if U, V & (T U X) + , the equation is 
satisfiable, in total U and V have at most n v appearances of variables and \U\ + \V\ < en (for an 
appropriately chosen in advance constant c). 

Directed graph Q for a satisfiable input equation Uq = Vq has 

• nodes labelled with proper equations 

• edges of Q are labelled with a proper family of operators. 

• an edge from U = V to U' = V' labelled % if and only if for some nondeterministic choices 
WordEqSat (that uses BlockCompImp instead of BlockComp) transforms U = V into U' = V' 
and % is the corresponding family of inverse operators. 

• each node is reachable from node labelled with Uq = Vq. 

The intuition is as follows: the first type of edges, labelled with correspond to PairComp in 
WordEqSat. The second, labelled with (|3D correspond to BlockCompImp. 

Observe, that a family of inverse operators (jl|) needs to specify, which letters represent compressed 
bocks of as, i.e. if there is an edge labelled with (jl|) from U = V to U' = V then we need to specify, 
which letters in U' = V are the letters a ei . However, this is easily done by listing appropriate pairs: 
letter in U' = V' and name ei to which it corresponds. Furthermore, we need only to specify the e, 
corresponding to blocks with visible lengths, and there are at most C(|C/| + |V|) of them; solutions 
that have blocks with invisible lengths are not minimal, see Lemma [20J Furthermore, a family of 
inverse operators ([3]) needs only to list the letters c that replace pairs ab, which is done in a similar 
way. 

We need to show that on one hand the presented construction can be performed in PSPACE, and 
on the other, that it describes all (minimal) solutions of a word equation. We begin with the latter. 

Lemma 27. Let S be a minimal solution of an equation U = V that is a node in Q, moreover, U = V 
is non-trivial (i.e. \U\ > 1 or \V\ > 1). Then there is an edge labelled with T~i, from U = V to some 
node U' = V such that S = H[S'] for some minimal solution S' of U' = V and H E %. 

Moreover, if there is an edge labelled with % from U = V to U' = V and S' is solution of U' = V 
then H[S'] is a solution of U = V for every H 6 "H. 

If the equation U = V is trivial (i.e. \U\,\V\ < 1) then there is one minimal solution, which is easy 
to describe. 

Proof. Observe that describing all minimal solution of a satisfiable equation U = V such that \U\ = 
\V\ = 1 is easy: 

• if a variable X does not appear in the equation, then S(X) = e for each minimal solution X, 
so it is enough to consider variables that appear in U = V 

• if one of the sides is a letter, then there is only one solution 

• if both sides consist of variables, then the minimal solution assigns e to each variable. 

If U = e or V = e, then the analysis of the minimal solutions can be made in a similar way. 

The rest of the lemma simply summarizes the properties of subprocedures of WordEqSat, presented 
in see Lemma E3 an d ESI which claim that PairComp and BlockCompImp transform solutions and 
preserve their minimality, and the definition of the graph Q. □ 

Constructing the representation of all minimal solutions. We show that it is possible to generate Q 
within PSPACE. In order to do so we should be able to decide in PSPACE whether: 

node label check: a given proper equation U = V labels a node in Q. 

edge label check: given two proper equations U = V and U' = V and a proper family H of 
operators there is an edge from U = V to U' = V labelled with %. 

when PSPACE procedures for these two tasks are known, constructing Q is easy: 

• we iterate over all proper equations (they have length at most en), for a fixed equation U = V 
we check whether U = V labels a node in Q. If so, we output it. 
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• we iterate over all pairs of proper equations U = V and U' = V and every proper family 
of operators H labelling this edge; note that both @ and ([ID can be described using 0(n) 
symbols. For a fixed triple we verify, whether there is an edge from U = V to U' = V labelled 
with H. If so, we output triple (U = V, H, V = V). 

Clearly this procedure uses only polynomial space and properly generates Q. 

It is thus left to show that node and edge label checks can be performed in PS PACE. 

Lemma 28. Node and label checks can be performed in PSPACE. 

Proof. Consider first node label check. It is trivial to verify, whether U = V has length at most en 
and at most n v variables' appearances. Using WordEqSat we can verify in PSPACE, whether U = V 
is satisfiable. Also, in NPSPACE we can verify, whether WordEqSat transforms Uq = Vq to U = V: 
we begin with U$ = Vq and transform it using WordEqSat until U = V is obtained. As by Lemma [U] 
WordEqSat uses O(nlogn) space, this is doable in NPSPACE. As NPSPACE=PSPACE, we are done. 

In a similar way we can show that edge label check can be performed in NPSPACE. Firstly using 
node label check we verify, whether both U = V and U' = V label nodes of Q. The label % uniquely 
identifies the subprocedure of WordEqSat that should be applied to U = V in order to obtain U' = V; 
we thus take U = V and apply this subprocedure. As WordEqSat uses at most nlogra space, by 
Lemma [U]and Lemma [To"! then this can be tested in NPSPACE and consequently edge label check can 
be executed in PSPACE □ 

The presented procedures are enough to construct a finite representation of all minimal solutions. 
It is left to formalise the transformation of minimal unifier solutions to minimal solutions. 

Lemma 29. Let S be a minimal unifier solution of U = V . Then for some nondeterministic choices 
Pop(r', 0) returns an equation U' = V such that S = $>[S'] for some minimal solution S' of U' = V , 
where 

$ = [] Append axX , 

where ax is the symbol left-popped from a variable X, i.e. ax € V U e. 

Proof. We know from Lemma [22] that Pop transforms the solution and the given operator is the 
corresponding inverse operator. Then for a minimal unifier solution S Pop and appropriate guesses 
U = V with S is transformed into U' = V' with S', such that S = <J>[S"]. To be more precise, the 
guesses are consistent with S, in the sense that Pop left-pops a letter ax if and only if the first of 
S(X) is ax and ax £ T'. We need to show that if S is a minimal unifier solution, then S' is a minimal 
solution. The proof is similar to the proof that Pop preserves minimality of solutions. 

We first show that S'(U') has no free letters. By Lemma EH if a is a free letter in S, then a is the 
first letter of some S(X). Then Pop pops a from X, and so a appears as an explicit letter in U' = V' . 
As we choose a arbitrarily, all free letters of S(U) appear in U' = V . As S'(U') = S(U), by LemmaEl 
the S' has no free letters and so it is a non-unifier solution. 

Suppose that S' is not minimal, as the case that it is a unifier solution is already excluded, this 
happens only in the case when S is an instance of some other unifier solution, i.e. there is a unifier 
solution S" such that S' = &[S"] for some non-erasing, non-permutating morphism <f> : (rur')ur" i— > 
(rur') + which is constant on TUT' (the set T" is a new set of free letters, such that r"n (r'UT) = 0). 
We show that this contradicts the assumption that S is minimal. Since $ prepends and appends letters 
from r U r", which are not affected by <f>', it can be concluded that S = $>'[$>[S']]: 

S = $[5'] 
= $[$'[5"]] 
= <£ / [<I>[S'"]] as 4> does not affect letters added by $ . 

It is left to show that <3?[S"'] is a unifier solution: but S" contains letters from T" and <I> only prepends 
letters to S"(X), hence $[£"'] is a unifier solution. As we already know that 0' is non-erasing and 
non-permutating, and constant on TUT', we conclude that S is an instance of 3>[S"'], which contradicts 
the assumption that it is minimal. □ 

Now we are ready to give the proof of the representation Theorem [3] on generating the finite 
representation of all solutions of a word equation. 
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proof of Theorem 0. Consider first a graph representation of minimal solutions of an equation. Each 
node has at most polynomial description, so does the edges. By Lemma [23 it can be checked in 
polynomial space, whether a node is present in the graph and whether an edge (labelled) joins two 
nodes. Since the description of a node (edge) has polynomial size, there are at most exponentially 
many nodes (edges, respectively). 

In order to generate a graph representation of all solutions, we use the approach presented in 
Lemma [23 Given an equation U = V we iterate over equations U' = V of length at most n + n v and 
using at most n free letters, whether Pop can transform U = V into U' = V . If so, output the node 
labelled with U' = V and make a graph representation of all its minimal solutions. The label on the 
edge from U = V to U' = V' is the inverse operator returned by Pop, see Lemma [23 Clearly, this 
procedure still runs in PSPACE, and so also the generated graph has exponential size. □. 

8. Other theoretical properties 

In this section, we give (alternative) proofs of known theoretical properties of word equations, using 
the approach of recompression; the mentioned properties are an exponential bound on the periodicity 
bound and the doubly-exponential bound on the size of the length-minimal solution. 

Exponential bound on exponent of periodicity. As already described in the introduction, expo- 
nential bound on exponent of periodicity, shown by Koscielski and Pacholski [TUj, is one of the most 
often used results on words equations. Their proof follows by first considering so-called P-presentations 
of a string; roughly, given a string w and a primitive word P, a P-presentation is a canonical factori- 
sation of w into powers of P and other strings. Then each power of P is associated with a number and 
treating such numbers as variables leads to a system of satisfiable Diophantine equations. Solutions 
of this system induced solutions of the word equation. In particular, length-minimal solution corre- 
sponded to minimal (in some sense) solution of the Diophantine equation. This is similar to results 
presented in Section [U but considering P-presentations instead of letters makes the argument much 
more involved. 

Using known results on minimal solutions of Diophantine equations and some simple calculus, an 
exponential upper-bound on the exponent of periodicity was shown. The last step of this proof, i.e. the 
estimation of the minimal solution, was relatively easy, while both the P-presentations and reduction 
from P-presentations to a system of equations were quite involved. 

We now show that using local recompression one can obtain exponential upper bound on exponent 
of periodicity relatively easy. 

For a word w the exponent of periodicity per(u>) is the maximal k such that u is a substring of 
w, for some u € T + ; T-exponent of periodicity per r (u>) restricts the choice of u to T. The notion of 
exponent of periodicity is naturally transferred from strings to equations: For an equation U = V, 
define the exponent of periodicity as 

pei(U = V) = max [per(5(t/))] , 

where the maximum is taken over all length-minimal solutions S of U = V; define the T-exponent of 
periodicity of U = V in a similar way. 

We show that an exponential upper bound on T-exponent of periodicity is easy and natural to 
obtain, one can think of it as a restriction of Koscielski and Pacholski original proof to its last part, 
i.e. to estimation of the minimal solution of a system of Diophantine equations. Then we show that 
the compression applied in Word Eq Sat basically preserves the exponent of periodicity, in particular it 
reduces the calculation of upper bound on per(U = V) to calculation of upper bound on per r (C7 = V). 

Minimal solutions of linear Diophantine systems. Consider a system of m linear Diophantine equations 
in r variables x±, . . . , x r , written as 

r 

(5a) 2J n i,j x j = n i for « = 1, . . . , m 

together with inequalities guaranteeing that each Xi is positive 

(5b) Xj > 1 for j = 1, . . . , r . 

In the following, we are interested only in natural solutions, i.e. the ones in which each component 
is a natural number; observe that inequality (|5bp guarantees that each of the component is greater 
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than zero. We introduce a partial ordering on such solutions: (q±, . . . , q r ) > (q[, . . . , q' r ) if and only 
if qj > q'j for each j = 1, . . . ,r. A solution (gi, . . . , q r ) is a minimal if it satisfies © and there is no 
solution smaller than it. (Note, that there may be incomparable minimal solutions.) 
It is known, that each component of the minimal solution is at most exponential: 

Lemma 30 ( |10|, Corollary 4.4]). For a system of linear Diophantine equations (0) letw = r+YlTLi \ n i\ 
and c = YaLi Ej=i \ n i,j\- If (<Zl> • • • > Qr) is its minimal solution, then qj < (w + r)e c ' e . 

The proof is a slight extension of the original proof of Koscielski and Pacholski, which takes in to 
the account also the inequalities. For completeness, we recall its proof, as given in |1U| . 

proof, cf. [TO] . The proof follows by estimation based from work of von zur Gathen and Sieveking 
and independently by Lambert 



Claim 4 (von zur Gathen and Sieveking |25j : Lambert |12j). Consider a (vector) equations and 
inequalities Ax = B, Cx > D with integer entries in A, B, C and D. Let M be the upper bound on 

A 



the absolute values of the determinants of square submatrices of the matrix I „ j , r be the number 

of variables and w the sum of absolute values of elements in B and D. Then for each minimal natural 
solution (gi,..., q r ) of ([5]), for each 1 < i < r we have qi < (w + r)M. □ 

So it remains to estimate M from Claim [H Observe that as the matrix C in our case is an identity, 
it is enough to consider the bound on the values of determinants of square submatrices of (riij), which 
was done by Koscielski and Pacholski |10j . 



Recall the Hadamard inequality: for any matrix N = (riij)^ =1 we have 

k k 



det 2 (iV)<nE<- 



And therefore 




det(iV) < 1 [ y2 n i j Hadamard inequality 



trivial 



simplification 

( 'E?=i(Eii|n^|)' 

< — — inequality between means 



f c\ k k k 

< - by definition Y^ Y] 



n 



'■./ 1 



j=l i=\ 

< e c ' e calculus: sup at k = c/e. 

Taking iV to be any submatrix of (nij) yields that M < e c ' e and consequently q% < {w + r)e c ' e , as 
claimed. □ 

Now from Lemma I5UI it can be easily concluded that 

Lemma 31. In each minimal solution of the small system of linear Diophantine equations for word 
equation U = V all coordinates are 0((\U\ + \V\)e 2riv ^ e ). 

Lemma 32 (cf. [TO], cf. [T]). Consider the solutions {ix,fx}xGX of ([2D and the corresponding induced 
solutions S' of the equation U = V. The Y -exponent of periodicity of length-minimal S' among them 
is 0(n v (\U\ + \V\e 2n -/ e )), while per r (S'(X)) for any variable X is 0((\U\ + \V\)e 2n -/ e ). 
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Proof. By Lemma Q3] all such solutions are similar. Take the length-minimal solution S' among 
them. Let £' x , (r' x ) be the lengths of the ax-prefix (ox-suffix) of S'(X). Then {£' x ,r x }xe.x is a 
solution of (0). Note that it is a minimal solution of this system: suppose that it is not. Then 
there is a solution {£ x ,r x }x£X of this system, such that £" x < £' x and r' x < r' x and one of these 
inequalities is strict. Then, again by Lemma UM the corresponding S" is also a solution of the 
word equation and it has all blocks of letters not longer than S', furthermore one of these blocks is 
strictly shorter and thus S"(JJ) is shorter than S'(U), thus S' is not length-minimal, contradiction. 
Hence, {£ x ,r x }xex is C((|L^| + |U|)e 2ni, / e ), by Lemma EU Now the maximal length of ej is clearly 
Exex £'x + r' x + \U\ + \V\ - n v , which is 0(n v (\U\ + \V\)e 2n */ e ), D 

So far we have only shown that T-exponent of periodicity is at most exponential. However judging 
by the work of Koscielski and Pacholski [TU], the difficulty is elsewhere, in the case of exponent of 
periodicity for longer words. We show that this is not the case: in the following lemma we show that 
recompression reduces the general exponent of periodicity to T-exponent of periodicity. Unfortunately, 
our result is weaker than the one obtained by Koscielski and Pacholski, as they in fact had a 2 cn bound, 
for appropriate c. 

Lemma 33. Let U = V with a solution S be transformed by some subprocedure of WordEqSat, i.e. 
PairComp or BlockComp (or BlockComplmpj into U' = V with S' . Then pei(S'(U')) < per(S(U)). 
Furthermore 

(per 1) per(S(U)) = per r (S(U)), or 

(per 2) per(S'(U')) > per(S(U)) - 1 . 

Proof. Recall that by Lemmata O and [7] for Pop and CutPrefSuff it holds that S(U) = S'(U') and so 
the claim trivially holds, as per(S'(U')) = per(S(U)). So it is enough to show the claim for PairComp 
and BlockComp restricted to compression (the analysis for BlockCompImp is the same). 

We first show that per (S(U)) > per(S'(U')) for PairComp and BlockComp. For the former ob- 
serve, that is it a composition of Pop, which preserves the exponent of periodicity, and of several 
PairCompNCr, so it is enough to show the inequality for PairCompNCr. By Lemma H] it holds that 
S'(U') = h~_^ ab (S(U)). Since c does not appear in S(U), we conclude from this that S(U) = 
h c -+ab{S' {U')) . Let w k be a substring of S'(U') : then h c ^ a b(w k ) = {h c ^ a },{w)) k is a substring of 

5(17), hence per(5'(C/')) < per(5(tf)). 

Similarly, for BlockComp is a composition of CutPrefSuff, which preserves the exponent of periodicity, 
and of several BlockCompNCr(a), so it is enough to show the inequality for BlockCompNCr(a). By 
Lemma H] for BlockCompNCr(a) it holds that S'(U') = b~ 1 (S(U)). Since all letters at introduced by 
6" 1 do not appear in S(U), it follows that S(U) = b a (S'(U')). Consider any w k that is a substring of 
S'(U'). Then b a {S'{w k )) = (b a (S'(w))) k is a substring of S(U). Thus per(S'(C/ / )) < per(5(C/)). 

We move to the second claim of the lemma, i.e. we are going to show that ( |per ip or ( |per 2p holds. 



Let 77i = -pex(S(U)). If there is a G T such that a p is a substring of S(U), then ( |per ip holds. So 
assume that w m is a substring of S(U), for some w ^ T. Moreover, we can assume that w ^ a k for 
every a and k, as this clearly reduces to the case of w = a. 

Consider first Pa irComp(Lettersi, Letter), and let w = bua, recall that by the assumption |tu| > 1 
and so \u\ > 0, i.e. it can be that u = e. How does the image of w m looks like in S'(U')? This 
depends on whether a G Lettersi and whether b G Letters2, in total there are four cases. Let 
h = Ilaer £ ,ber r h c ^ ah - 

b £ Letters2' The further analysis depends on whether a € Lettersi or not 

a £ Lettersi: In this case, h~ l (w m ) = (h~ 1 (w)) m and so (h~ 1 (w)) m is asubstringof h~ 1 (S(U)); 

thus, pev(S'(U')) > per(S(U)). 
a G Lettersi: In this case it might be that the last letter of w m is compressed with the 
following letter. Still, when considering the prefix u;" 1-1 , we can make the analysis as in 
the previous case (since the last letter of w 771 " 1 is not compressed). Hence, per(S'(U')) > 
per(S(U)) - 1. 
b G Letters2'- Similarly, the further analysis depends on whether a G Lettersi or not 

a $l Lettersi: As in the previous subcase, the first letter of w m might be compressed, how- 
ever, when considering the suffix w m ~ l of w m we know, that its first letter is not com- 
pressed. Then /i" 1 ^" 1 " 1 ) = h^iw)" 1 " 1 is a substring of S'(U'), thus per(5'(C/')) > 
per(S(C/)) - 1. 
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a € Lettersi: Represent w m as b(uab) m ~ l ua. Then the analysis is similar as in the first 
subcase: the last letter of uab is not a, moreover, even though the first letter of uab may 
be 6, this is not preceeded by a and so it is not compressed. Hence, (h" 1 (uab)) 771 " 1 is a 
substring of S'(U') and so per(S'{U')) > per(5(C/)) - 1. 

The analysis for BlockComp is even simpler: let w = tfua r , where u does not begin with b and does 
not end with a. Let 6 _1 = ]laer ^a ■ Then 

As by the assumption u does not start with b, hence S'(U') contains b~ 1 ((ud r b e ) m ~ 1 ) = (b" 1 (ual 7/)) m_1 , 
and so per(S"(C/')) > m — 1, as claimed. □ 

Corollary 1 (cf. [TU], cf. Lemma [TJ. 27ie exponent of periodicity of equation of a length-minimal 
solution S is single exponential in \U\ + \V\. 

Proof. Let U\ = Vi, U2 = V2, ■ ■ ■ , U m = V m be all equations generated during the run of WordEqSat, 
in this order. Clearly, none of these equations is repeated, and as each of them is of length at most 
c'n (see Lemma EJ), thus m < (c'n) cn+1 < n cn , for some constants c and d. By Lemma 1551 it holds 
that per(CTj) = per r (C/j) or per(£/j) < per([/j + i) + 1; moreover per(£/ m = V m ) < 1, as \U m \, \V m \ < 1. 
As by Lemma [TS1 it holds that 

per r (^)<c / n„(|^,| + |^|)e 2n "/ e 

for some constant c', we conclude that 

per(E/i) = 0(n v ne 2riv/e + n cn ) = 0(n cn ) , 
for some constant c, in particular it is single exponential in n = \Ui\ + |Vi|. □ 

Double exponential bound on minimal solutions. It was shown by Plandowski [TB] that the 
size of the length minimal solution of word equation is always doubly exponential. This result was 
achieved by careful and clever analysis of factorisations of minimal solutions; the proof is basically 
independent from the analysis of the PlaSat, though uses similar types of factorisations of words. 

Since we know that on one hand the running time of WordEqSat is polynomial in n and log N, on 
the other the space consumption is C?(n log n), the doubly exponential upper bound on logN seems 
natural. However, both presented bounds are upper bounds and so cannot be directly compared. To 
compare them we want to show that the running time is in fact also lower-bounded in terms of log N. 

Lemma 34. Let N be the size of the length-minimal solution. Then the number of phases of WordE- 
qSat is 0(log N/po\y(n)) for every accepting run, regardless of the nondeterministic choices. 

Proof. Suppose that the equation U = V is transformed into an unsatisfiable equation U' = V; then 
we are done, as it will never be turned into a satisfiable instance. So in the following we consider only 
the case, in which each of the equations is satisfiable. 

The solution S' of U' = V is obtained from S of U = V by two separate compression sub-phases: 
in first, maximal blocks of letter are compressed into one letter. In second, some pairs ab, for a ^ b 
are replaced by a fresh letter. The second phase is easier to analyse, let us focus on it. Notice, that 
if c is introduced as a letter for a pair ab then c is not compressed in the rest of this phase, hence at 
most two letters are compressed into one and those new letters are not further compressed. Thus, the 
second compression phase shortens the string S(U) by a factor of at most 2. Let us return to the first 
sub-phase. 

Consider any length-minimal solution S' of U' = V, let its length be N'. Consider all possible 
solution S of U = V which are transformed into S' by BlockComplmp. They correspond to solutions 
{@x,rx}xex of ^) and by Lemma 1521 the length of the longest block of letters in S(X) (i.e. the 
T-exponent of periodicity) of length-minimal S among them is C((|C/| + |y|)e 2n "' e ). Consequently 

\S{U)\/\S'{U')\ = \S{U)\/N' < cne 2n " /e . 

Since |5*(C/)| > N, where N is the length of the length-minimal solution of U = V, we obtain that 
N/N' < cn v e 2riv ' e . Adding the factor two in the second sub-phase we obtain the upper bound on 
2cne 2riv ' e on the compression ratio. The rest of the calculations follows easily. □ 
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Corollary 2 (cf. [18J). The size of the length-minimal solution is at most 2 q ^ n ' nv v for some polynomial 
q and constant c. 

Proof. By LemmaOthe equation stored by WordEqSat has at most 0(n£ n " log n) many phases. On the 
other hand, by LemmaEMl there are at least c'(log N)/p(n) phases, for some constant d and polynomial 
p. Thus, 

d (log N)/p{n) < n£ n "logn , 
which yields the claim. □ 
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